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1 . S This is aFIRST submission of items concerning a filing under 35 U.S.C. 371. 

2. im This is aSECOND or SUBSEQUENT submission of items concerning a filing under 35 U.S.C. 371. 

3. Tliis is an express request to promptly begin national examination procedures (35 U.S.C. 371(5). 
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The US has been elected by the expiration of 19 months from the priority date (PCT Article 31). 
A cmy of the International Application as . pub! i .qhed . 

a. I I is attached hereto (required only if not communicated by the International Bureau). 

b. [x] has been communicated by the bitemational Bureau. 

c. □ is not required, as the application was filed in the United States Receiving Ofice (RO/US). 
An English language translation of the International Application as filed (35 U-S.C. 37I(cX)). 
Amendments to the claims of the Internationa! Application under PCTT Article 19(35 U.S.C 371(c)(3)) 
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have been communicated by the International Bureau. 

c. □ have not been made; howevCT, the time limit for making such amendments has NOT expired. 
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An English language translation of the amendments to the claims under PCT Article 19 (35 U.S.C371Cc)(3)). 
An oatii or declaration of the inventor(s) (35 U.S.C. 37 1 (c)(4)). 

An English language translation of the annexes to the International Preliminary Examination Repcrt under 
PCT Article 36 (35 U.S.C. 371(c)(5)). 



Items 11 to 16 below concern docament(5) or information included: 

An Information Disclosure Statement under 37 CFR 1.97 and 1.98. 

An assignment document for recording. A separate cover sheet in compliance witfi 37 CFR 3.28 and 3.3 1 is included. 
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17. ELI The following fees are submitted: 

BASIC NATIONAL FEE ( 37 CFR 1^92 (a) (I) - (S) ) : 

Neither international preliminary examination fee.^37 C'PR 1.482) 
nor international search fee (37 CFR 1 .445(a)(2)) paid to USPTO 
and International Search Report not prepared by the EPO or JPO *10«U.«0 

IntOTiational preliminary examination fee (37 CFR 1.482) not paid to 

USPTO but International Search Report prepared by the EPO or JPO $860.00 

International preliminary examination fee (37 CFR 1 .482) not paid (o USPTO but 
international search fee (37 CFR 1.445(a)(2)) paid to USPTO $710.00 

International preliminary examination fee paid to USPTO (37 CFR L482) 

but all claims did not satisfy provisions of PCT Article 33(l)-(4) .$690.00 

Internationa] preliminary examination fee paid to USPTO (37 CFR 1.482) 

and all claims satisfied provisions of PCT Article 33(l)-(4> $100.00 

ENTER APPROPRIATE BASIC FEE AMOUNT = 



ATTC«NEVS DOCKET NUMBER 



CALCULATIONS pto use only 



$ 860.00 



Surcharge of$130.00 for furnishing the oath or declaration later thanQ 20 D 30 
months from the earliest claimed priority date (37 CFR 1.492(e)). 



CLAIMS 



NUMBER FILED 



NUMBER EXTRA 



RATE 



6H4.00 



Total claims 



-20 = 



X $18.00 



Independent claims 



-3_= 



X $80.00 



$ 0 



MULTIPLE DEPENDENT CLAIM(S) (if applicable) 



+ $270.00 



$ 0 



TOTAL OF ABOVE CALCIJLATIONS = 



$ 1544.00 



Applicant claims small entity status. See 37 CFR 1.27. The fees indicated above 
are reduced by 1/2. 



772.00 



SUBTOTAL = 



$ 772.00 



Processing fee of$130.00 for furnishing the English translation later than CD^O Qso 
months from the earliest claimed priority date (37 CFR 1 ■492(f)). 



$ 



TOTAL NATIONAL FEE = 



S 77?..nn 



Fee for recording the enclosed assignment (37 CFR 1.21(h)). The assignment mat be 
accompanied by an appropriate cover sheet (37 CFR 3.28, 3.31). $40.00 per property 



$ 



TOTAL FEES ENCLOSED = 



$ 



772.00 



*PRELIHINARY AMENDMENT DELETING MULTIPLE 
DEPENDENCIES ENCLOSED: 



Amount to be 



chained: 



a. 



A check in the amount of $ to cover the above fees is enclosed. 



b. n Please charge my Deposit Account No . in the amount of to cover the above fees. 

A duplicate copy of this sheet is enclosed. 

c. H The Commissioner is hereby authorized to charge any additional fees which may be required, or credit any 

overpayment to Deposit Account No. 12-0415 A duplicate copy of this sheet is enclosed. 



NOTE: Where an appropriate time limit under 37 CFR 1.494 or 1.495 has not been met, a petition to revive (37 CFR 
1.137(a) or (b)) must be filed and granted to restore the application to pendin g status. 



SEND ALL CORRESPONDENCE TO; 
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Applicant or Patentee C orrado FQGHER 
Serial or Patent No. 09/7^3,823 



^Attorney's Docket No. B-4075PCT 618464-4 



Filed or Issued: 16 January 2001 



International Application No. PCT/IT99/00226 filed 19 July 1999 

For: " A SYNTHETIC POLYNUCLEOTIDE CODING FOR HUMAN LACTOFERRIN. VECTORS. 
CELLS AND TRANSGENIC PLANTS CONTAINING IT" 

Verified Statement (Declaration) Claiming Small Entity Status 

37 CFR 1.9(f) and 1.27(c) - Small Business Concern 

1 hereby declare that I am; 

{ 1 the owner of the small business concern identified below: 

i . ] an official of the small business concern empowered to act on behalf of the concern identified 

below: 

Name of Concern: PLANTECHNO S.r.l. 



Address of Concern: Via Staffolo 60 26040 Casalmagglore OR ITALY 

I hereby declare that the above identified small business concern qualifies as a small business concern as defined In 
13 CFR 121.3-18, and reproduced in 37 CFR 1.9(d), for purposes of paying reduced fees under Section 41(a) and 
(b) of Title 35, United States Code, in that the number of employees of the concern, including ttiose of its affiliates, 
does not exceed 500 persons. For purposes of this statement, (1) the number of employees of the business concern 
is the average over the previous fiscal year of the concern of the persons employed on a full-time, part-time or 
temporary basis during each of the pay periods of the fiscal year, and (2} concerrjs are affiliates of each other when 
either, directly or indirectly, one concern controls or has the power to control the other, or a third party or parties 
controls or has the power to control both. 

i hereby declare that rights under contract or law have been conveyed to and remain wiHi the small business concern 
identified above with regard to the invention, entitled: " A SYNTHETIC POLYNUCLEOTIDE C ODING FOR 
HUMAN LACTOFERRIN. VECTORS. CELLS AND TRANSGENIC PLANTS CONTAINING IT' 



by lnventor(s): 
described in: 
I ] 

[ ] 



Corrado FOGHER 



the specification filed herewflth. 

application serial no.: D9/7V3823 . filed 16 January 2001 
patent no. , issued 



If the rights held by the above identified small business concern are not exclusive, each individual, concern or 
organization having rights to the Invention is listed beloW, and no rigtits to the invention are held by any perscwi, 
other than the inventor, who could not qualify as a small business concern under 37 CFR 1.9(c) or by any concern 
which would not qualify as a small business concern under 37 CFR 1 .9(d) or a non-profit organization under 37 CFR 
1 .9(e). *NOTE: Separate verified statements are required from each named person, concern or organization having 
rights to the invention aven-ing to their status as small entities. (37 CFR 1 .27) 



NAME: 



ADDRESS 



[ ] INDIVIDUAL [ j SMALL BUSINESS CONCERN [ ] NONPROFIT ORGANIZATION 



NAME: 



ADDRESS 



[ ] INDIVIDUAL [] SMALL BUSINESS CONCERN E ] NONPROFIT ORGANIZATION 

1 acknowledge the duty to file, in this application or patent, notification of any change in status resulting in loss of 
entitlement to small entity status prior to paying, or at the time of paying, the earliest of the issue fee or any 
maintenance fee due after the date on which status as a small entity Is no longer appropriate. [37 CFR 1.28<b)l 

I hereby declare that all statements made herein of my own knowledge are true and that all statements made on 
information and belief are believed to be true; and further that these statements were made with the knowledge that 
willful false statements and the like so made are punishable by fine or imprisonment, or both, under Section 1001 of 
Title 18 of the United States Code, and that such willftjl felse statements may jeopardize the validity of the 
application, any patent issuing thereon, or any patent to which this verified statement (s directed. 



Name of Person Signing; 
TiHe of Person Other Than Owner; 
Address of Person Signing: 



Signature: 




Date: 1 ^tXiCM 2 0Q^ 
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Applicant or Patentee: Corrado FOGHER Attorney's Docket No. B-4075PCT 618484-4 

Serial or Patent No. 09/7«3.823 

Filed or Issued; 16 January 2001 

International Application No. PCT/IT99/00226 . filed 19 July 1999 

For "A SYNTHETIC POLYNUCLEOTIDE CODING FOR HUMAN LACTOFERRIN. VECTORS. 
CELLS AND TRANSGENIC PLANTS CONTAINING IT' 

VERIFIED STATEMENT (DECLARATION) CLAIMING SMALL ENTITY 
STATUS (37 CFR 1.9 (f) and 1.27(b)) - INDEPENDENT INVENT0R(S) 

As a below named inventor, I hereby declare that ! qualify as an independent inventor as defined in 37 CFR 1.9 (c) 
for purposes of paying reduced fees under section 41 (a) and (b) of Title 35, United States Code, to the Patent and 
Trademark Office with regard to the invention entitled " A SYNTHETIC POLYNUCLEOTIDE CODING FOR HUMAN 

LACTOFERRIN. VECTORS. CELLS AND TRANSGENIC PLANTS CONTAINING IT '1 

described in 

[ ] the specification filed fierewith. 

[ 1 application serial no.: , filed 



[X] International Application No. PCT/IT99/00226 . filed 19 July 1999 . and 
identified as U.S. Application No. 09/7^3.823 

i have not assigned, granted, conveyed or licensed and am under no obligation under contact or law to assign, grant, convey or 
license, any rights In the Invention to any person who could not be classified as an independent inventor under 37 CFR 1 .9 (c) if that 
person had made the invention, or to any concern wh\dt would not qualify as a small business concern under 37 CFR 1 .9 (d) or a 
nonprofit organization under 37 CFR 1 ,9 (e). 

Each person, concem or organization to which I have assigned, granted, conveyed or licensed or am under an obligation under 
contract or law to assign, grant, convey or license any nghts in the invention is listed below 



i^j [ ] no such person, concern, or arganizalion 

persons, tonterns or oi ganizatiuns listad below* 
:5 *NOTE: Separate venfied statements are required from each named person, concern or organi^tion having 

" 3 nghts to the invention averring to their status as small entities. (37 CFR 1 27) 



FULL NAME PLANTECHNO S r I. 

ADDRESS _Via Staffolo , 60 , 26040 Casalmaqoiore CR ITALY 

[] INDIVIDUAL [Xj SMALL BUSINESS CONCERN [] NONPROFIT ORGANIZATION 



FULL NAME . 
ADDRESS „ 



] INDIVIDUAL [ 1 SMALL BUSINESS CONCERN [ ] NONPROFIT ORGANIZATION 

I acknowledge the duty to file, in this application or patent, notification of any change in status resulting Ir loss of entitlement to small 
entity status prior to paying, or at the time of paying, the earliest of the issue fee or any maintenance fee due after the date on which 
atglds as a small snllfy is no longer appropriate {37 CFR 1 28 (b)) 

1 hgreby dec'.are that all statements made herein of my own Knovjiedge are true and that all statements made on infomiation and 
belief are believed to be true; and further that these slatements were made with the knowledge that willful false statements and the 
like so made are punishable by fine or imprisonment, or both, under section 1001 of Title 16 of the United States Code, and that such 
wiiifuf felse sfafemente may jeopardize the validity of the application, any patent issuing thereon, or any patent to which this verified 
statement is directed. 



Corrado FOGHER . 

Name of First Inventor Sign^ajjif^of FirsfTnventor Date 
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IK THE UHITED STATES PATENT AND TRADEMZ^OC OFFICE 



Applicant: Gorrado FOGHER 

U.S. Appln. No.: not yet 
assigned 

U.S. Filing Date: concurrently 
herewith 

International Application No: 
PCT/IT99/00226 

International Filing Date: 
19 July 1999 

For: "A SYNTHETIC POLYNUCLEOTIDE 
CODING FOR mJMAN LACTOFERRIN; 
VECTORS. CELLS AND TRANSGENIC 
PLANTS CONTAINING IT" 



Re: Preliminary Amendment 
Group: not yet assigned 

Examiner: not yet assigned 



Our Ref.: B-4075PCT 618484-4 



Date: January 16, 2001 



Commissioner of Patents and Trademarks 
Box PCT 

Washington, D.C. 20231 

Attn: United States Designated/Elected Office (DO/EO/US) 
Sir: 

Prior to examination of the above-identified application, it is 
respectfully requested that the following amendments be made to the 
Claims as amended during IPE: 



IN THE CLAIMS 



1. (amended) A polynucleotide coding for h-uman lactoferrin, 
[characterized in that it hasl havincr a sequence totally or partially 
corresponding to the sequence reported in the annexed sequence 
listing as SEQ ID N0:1, said sequence being optimized for the in 
plant expression. 

2. (unchanged) The polynucleotide according to claim 1, 
wherein said polynucleotide has fused at the 5 '-terminus the leader 
sequence of 7s basic globulin gene or the leader sequence of J^- 
conglycinine gene. 
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3. (unchanged) The polynucleotide according to claim 2, 
wherein said leader sequence is the sequence reported in the annexed 
sequence listing as SEQ ID NO: 13 or the sequence reported in the 
annexed sequence listing as SEQ ID NO: 14. 

4. (amended) A recombinant DNA vector comprising the 
polynucleotide according to [any of] claim[s] 1 [to 3] operatively 
linked to regulation elements allowing the expression of said 
polynucleotide . 

5. (amended) The recombinant DNA vector according to claim 4, 
wherein said regulation elements [is]are a plant expression cassette 
allowing the tissue specific expression of said polynucleotide. 

6. (unchanged) The vector according to claim 5, wherein said 
plant expression cassette includes the promoter of the gene coding 
for the protein basic globulin 7 S. 

7. (unchanged) The vector according to claim 6, wherein said 
promoter has the sequence reported in the annexed sequence listing 
as SEQ ID NO: 21 

8. (amended) The vector according to claim 6 [or 7], wherein 
said plant expression cassette includes the leader sequence of the 
gene coding for the protein basic globulin 7 S. 

9. (unchanged) The vector according to claim 8, wherein said 
leader sequence is the sequence reported as SEQ ID NO: 13. 

10. (unchanged) The vector according to claim 5, wherein said 
plant expression cassette includes the promoter of the gene coding 
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for S-conglycinine protein. 

11. (unchanged) The vector according to claim 10, wherein said 
promoter has the seqiience reported in the annexed sequence listing 
as SEQ ID NO:22. 

12. (amended) The vector according to claim 10 [or 11], 
wherein said plant expression cassette includes the leader sequence 
of the gene coding for the S-conglycinine protein. 

13. (unchanged) The vector according to claim 12, wherein said 
leader sequence is the sequence reported in the annexed sequence 
listing as SEQ ID NO: 14. 

14. (amended) A vegetal cell including the polynucleotide 
according to [any of] claim[s] 1 [to 3] . 

15. (amended) A vegetal cell including the vector according to 
[any one of the] claim[s] 4 [to 13]. 

16. (amended) A cellular aggregation obtainable from cells 
according to claim 14 [or 15] . 

17 . (unchanged) The cellular aggregation according to claim 
16, whererin said aggregations are calluses capable of regenerating 
transgenic plants. 

18. (amended) A transgenic plant including in a tissue cell 
the polynucleotide according to [any of] claim[s] 1 [to 3]. 

19. (unchanged) The transgenic plant according to claim 18, 
wherein said tissue cell is a storage tissue cell. 
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20. (unchanged) The transgenic plant according to claim 18, 
wherein said tissue cell is a fruit tissue cell. 

21. (amended) The transgenic plant according to [any of] 
claim[s] 18 [to 20], said plants being selected from the group 
consisting of solanaceae, cereals, leguminosae, frurt bearing plants 
and horticultural plants. 

22. (unchanged) The transgenic plant according to claim 21, 
said plant being selected from the group consisting of soya, tobacco 
and rice. 

23. (amended) [Use nfl A me^thod of using the vector according to 
[any of] claim[s] 4 [to 13] for the transformation of vegetal cells. 

24. (amended) [Use nf^ A method of using the transgenic plant 
according to [any one of the] claim[s] 18 [to 22,] for the 
production of nutriceufcicals. 

''25. (amended) [Use nfi A method of using the transgenic plant 
according to [any of] claim[s] 18 [to 22,] for the production of 
human lactoferrin. 

26. (amended) [Use nfl A method of using the transgenic plant 
according to [any one of] claim 18 [to 22,] for the production of 
lactoferrin flours or of lactoferrin extracts obtained from tissues 
of said transgenic plant. 

27. (amended) [Use of 1 A me^thod of using the transgenic plant 
according to [any one of the] claim[s] 18 [to 22] for the production 
of functional foods containing lactoferrin. 
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28. (amended) The fusel method according to claim 27, wherein 
said functional food being selected from the group consisting of 
vegetal milks, fruit juices, fruit and/or vegetable homogeneized 
foods . 

29. (unchanged) A plant expression cassette allowing the 
tissue specific expression of a gene of interest comprising the 
promoter of the gene coding for the protein basic globulin 7 S. 

30. (unchanged) The plant expression cassette according to 
claim 29, wherein said promoter has the sequence reported in the 
annexed sequence listing as SEQ ID NO: 21. 

31. (amended) The plant expression cassette according to claim 
29 [or 30], wherein said plant expression cassette includes the 
leader sequence of the gene coding for the protein basic globulin 7 
S. 

32. (unchanged) The plant expression cassette according to 
claim 31, wherein said leader sequence is the sequence reported as 
SEQ ID NO: 13 . 

33. (amended) A recombinant DNA vector comprising a gene of 
interest under the control of the plant expression cassette 
according to [any of] claim[s] 29 [to 32] . 

34. (amended) The vector according to claim 33 [when depending 
on claim 31 or 32.1 wherein said plant e :Kr)ression cassette includes 
the leader sequence of the gene codin g for the protein basic 
globulin 7 S and wherein said gene of interest is fused to the 
leader sequence. 
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35. (unchanged) A plant expression cassette allowing the 
tissue specific expression of a gene of interest comprising the 
promoter of the gene coding for the S-conglycinine protein. 

36. (unchanged) The plant expression cassette according to 
claim 35, wherein said promoter has the sequence reported in the 
annexed sequence listing as SEQ ID NO: 22. 

37. (amended) The plant expression cassette according to claim 
35 [or 36] / wherein said plant expression cassette includes the 
leader sequence of the gene coding for the [leader sequence of the 
gene coding for the] Jl-conglycinine protein. 

38. (unchanged) The plant expression cassette according to 
claim 37, wherein said leader sequence is the sequence reported as 
SEQ ID NO: 14. 

39. (amended) A recombinani: DNA vector comprising a gene of 
interest under the control of the plant expression cassette 
according to [any of] claim[s] 35 [to 38] . 

40. (amended) The vector according to claim 39 [when depending 
on claim 36 or 37,l wherein said plant expression cassette includes 
the leader sequence of the gene coding for the K-conalvcinine 
protein and wherein said gene of interest is fused to the leader 
sequence. 

41. (amended) A vegetal cell including the vector according to 
[any one of the] claim[s] 33[, 34, 39 or 40]. 



42. (unchanged) A cellular aggregation obtainable from the 
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cell according to claim 41. 

43 . (unchanged) The cellular aggregation according to claim 
42, said aggregations being calluses capable of regenerating 
transgenic plants . 

44. (amended) A transgenic plant including in a tissue cell 
the vector according to [any of] claim[s] 33 [, 34, 39 or 40]. 

45. (unchanged) The transgenic plant according to claim 44, 
wherein said tissue cell is a storage tissue cell. 

46. (amended) The transgenic plant according to claim 44, 
wherein said tissue cell[s] is a fruit tissue cell. 

47. (amended) The transgenic plant according to [any of] claim 
[43 to 451M, said plant [s] being selected from the group consisting 
of solanaceae, cereals, leguininosae, fruit bearing plants and 
horticultural plants. 

48. (unchanged) The transgenic plant according to claim 47, 
said plant being selected from the group consisting of soya, tobacco 
and rice. 

49. (amended) [Use ofi A method of using the vector [s] 
according to claim[s] 33 [, 34, 39 or 40] for the transformation of 
vegetal cells. 

Please add the following new claims: 

50. A vegetal cell including the vector according to claim 39. 
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51. A cellular aggregation obtainable from the cell according 
to claim 50. 

52. The cellular aggregation according to claim 51, said 
aggregations being calluses capable of regenerating transgenic 
plants . 

53. A transgenic plant including in a tissue cell the vector 
according to claim 39. 

54. The transgenic plant according to claim 53, wherein said 
tissue cell is a storage tissue cell. 

55. The transgenic plant according to claim 53, wherein said 
tissue cell is a fruit tissue cell. 

56. The transgenic plant according to claim 53, said plant 
being selected from the group consisting of solanaceae, cereals, 
leguminosae, fruit bearing plants and horticultural plants. 

57. The transgenic plant according to claim 56, said plant 
being selected from the group consisting of soya, tobacco and rice. 

58. A method of using the vector according to claim 39 for the 
transformation of vegetal cells. 

REMARKS 

The claims in this application were amended during International 
Preliminary Examination (IPE) . A copy of the amended and claims 
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attached. This Amendment is directed to the claims as amended 
during IPE and not to the claims as filed. Thus the amendment made 
during IPE must be entered before this amendment is entered. This 
Amendment amends Claims 4, 8, 12. 14-16, 18, 21, 23-27, 31, 33-34, 
37, 39-41, 44, 47, and 49 so that they are no longer multiply 
dependent. Claims 50-58 have been added to provide similar 
coverage. The Applicant may elect to amend Claims 4, 8, 12, 14-16, 
18, 21, 23-27, 31, 33-34, 37, 39-41, 44, 47, and 49 to make them 
again multiply dependent or to add additional claims to this 
application to provide coverage similar to, broader than or narrower 
than the present claims once examination on the merits has begim. 

This Amendment also amends claims 23-28 and 49 to eliminate non- 
statutory "use" claims. 

The Examiner should ensure that the amendments made by the IPEA are 
entered together with the amendments set forth above. 



Respecj&iTIily submitted. 




Richard P. Berg 
Reg. No. 28,145 
Attorney for Applicant 
LADAS Sl parry 

5670 Wilshire Boulevard #2100 
Los Angeles, California 90036 
(323) 934-2300 



Enclosed: copy of Claims as amended during IPE (5 pages) 
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A SYNTHETIC POLYNUCLEOTIDE CODING FOR HUMAN LACTOFERRIN, 
VECTORS, CELLS AND TRANSGENIC PLANTS CONTAINING IT 

DESCRIPTION 
Field of the invention 

The present invention relates to the field of the 
vegetable biotechnologies, and in particular to the 
plants and vegetal cells transformation techniques and 
systems, to cells and transgenic plants thus generated, 
and to their use. 

State of the art 

In recent years, due to the large variety of 
applications deriving from the utilization of the genetic 
engineering techniques in the biology of the vegetals, 
the use of genetically modified plants has gradually 
increased. 

In particular, the development of techniques for the 
transformation of plants into organisms capable of 
producing proteins of commercial interest has acquired a 
remarkable practical importance. 

In fact, the generation of recombinant plants 
containing a heterologous gene of interest, and their use 
in production processes on an industrial scale, allows to 
overcome a series of drawbacks characterizing the 
production systems presently in use, particularly those 
based on cell cultures. 

Indeed, recombinant DNA technology allowed the 
generation of transgenic cells that are used in the 
production of heterologous proteins of interest . In 
particular, .animal {and specifically mammal) cell 
cultures allow the production of proteins of interest, 
even extremely complex ones, in their native form, but 
the related process is extremely expensive, as 
hectoliters of cells are required for the production of 
an amount of proteins sufficient for a commercial {e.g. , 
pharmacological or alimentary) use {Stowell et al . , 
1991) . 

An alternative in this direction is then provided by 
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the production, carried out in prokaryotic cell cultures, 
a cheaper process, meeting however a serious obstacle in 
the inability of those systems to effect the post 
transcriptional and post-translational modifications 
required for the expression of complex heterologous 
proteins, carried out only by the eukaryotic systems 
(Click and Pasternak, 1994) . 

Therefore, a possible solution was pursued in the 
transformation of complex eukaryotic systems which could 
ensure at the same time the production of active proteins 
and the utilization of inexpensive processes. (Watson et 
al. 1992). 

In particular, plants possess the required 
potential, and their capacity of functioning as 
bioreactors for the production of complex proteins 
(plants are highly efficient in this type of processes, 
ensuring a high degree of expression) , in a cost- 
effective way (plant cultivation is relatively 
Inexpensive) and with a high yield (each single harvest 
can yield high amounts of proteins ) was proved ( Fraley 
and Schell, 1991) . 

Further, very many plants satisfy the important 
requirement of the non-allergenicity needed for the 
production systems of recombinant proteins of 
pharmacological or alimentary int'erest . In fact, the 
organisms used in those productions must belong to the 
GRAS (Generally Recognized As Safe) organism category, 
i.e. organisms that having been used by man for a long 
time, are considered safe for man and for the ecosystem 
as well. Obviously, plants such as Leguminosae, cereals, 
tobacco, horticultural plants in general and fruit trees, 
satisfy this requirement in nature. Among leguminosae. 
Soya is a basic food in the diet of many populations 
mostly of the Third World Countries, but recently in 
European Countries as well. In fact. Soya-derived 
compounds constitute ingredients that are present in a 
vast number of food products, such as e.g., lecithin and 
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seed oil, while the seeds of this Leguminosa yield a 
flour that is employed in various food like soybean milk 
containing approximately the 8% thereof. 

Therefore plants, while being the raw material from 
which the product can be extracted according to 
conventional processes, at the same time constitute an 
alternative to the conventional production. In fact, they 
can be used as functional foods, i.e. foods genetically 
modified so as to be enriched from a nutritional point of 
view, and in case assuming important properties as a 
natural drug. 

Thus the step of the protein extraction from the 
plant has been eliminated, and the heterologous protein 
expression does not prelude to an extraction and 
purification thereof, steps that account fo;i: most of the 
production cost of a drug, but simply enriches a 
vegetable nutriment, which thus becomes a nutriceutical , 
i.e. a nutriment having a pharmaceutical value. 

This is the reason for the research efforts aimed at 
generating genetically modified plants, optimized for the 
above-mentioned applications. 

However, to date the evaluation of the heterologous 
expression capacity typical of various plant species was 
exclusively performed under laboratory conditions, or 
anyhow on relatively limited surfaces in hothouse. 

For instance, expression in plants of enkephalin and 
human serum albumin, - as well as of mice monoclonal 
antibodies was studied (Watson et al. , 1992) . More 
recently, always referring to proteins relevant from a 
pharmacological point of view, two additional human 
proteins of therapeutic importance, i.e. active 
interleukin-6 and C protein (a serum anticoagulant) , were 
successfully expressed in tobacco. In all these cases the 
model plant on which the functionality of the prepared 
gene was tested is tobacco, whereas usually the plant 
finally selected for production is a leguminosa, whose 
seed content in storage proteins is high. 
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These experiments proved that often the expression 
levels of the heterologous protein in the vegetal 
bioreactors do not reach high enough levels to meet the 
commercial demands, and that anyhow it can be improved 
applying new information on the plant gene control. 
Specifically, it was demonstrated that the level of 
produced recombinant protein has to reach more than 1% of 
the total protein amount in order to become economically 
significant, a level obviously not achieved by the simple 
introduction of the heterologous gene, whose expression 
therefore needs to be maximized. 

In relation to the vegetal cell biology, in order to 
maximize the level of the in plant expression, an action 
on various levels is necessary: increase of genie 
transcription, increase of transcript stability and of 
the translation process yield. Moreover, it is further 
necessary to fix the inserted gene and to minimize the 
risk of the occurrence of silencing or of genie 
inactivation. All these factors are crucial in assessing 
the in plant expression level of the heterologous gene, 
that, as afore stated, apart from some exceptions usually 
settles at rather low levels (Owen and Pen, 1996) . 

Among all these, the most crucial factor, together 
with the transcription level depending on the 
preferential presence of certain codons, would seem to be 
the stability of the recombinant protein in the 
heterologous host, as the likely probable cause of its 
easy elimination. The relative instability might be the 
consequence, on one hand of the inability of the 
translated heterologous protein to assume a stable 
structural conformation, on the other hand of the 
ultrastructural compartment in which it is directed after 
the translation, where the presence of proteases and of 
particular pH values determine its degradation. 
' Accordingly, it is therefore important both to 

provide the modification of the heterologous sequences in 
order to ensure the codon optimization and to carry out a 
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careful selection of the targeting sequences , capable of 
directing the translated protein into preferential 
ultrastructural compartments, as i.e. the seed storage 
tissue, capable of ensuring the stability of the product. 
Concerning the latter issue, the best strategy of action 
in this sense, is that of providing the maintenance of 
the endogenous signal sequences of the plant selected for 
the final production. In fact, the adoption of these 
sequences prevents the alteration of the cell internal 
balance, consequence of the unavoidable random 
accumulation of exogenous proteins that would take place 
in their absence. In this regard, the fact that the 
ultrastructural compartments have different 

characteristics in the cells of the various tissues has 
to be taken into account. 

For instance, the heterologous proteins accumulated 
together with the storage proteins of the seeds in the 
transgenic plants, are more stable as compared with those 
of the vegetative organs (Owen and Pen, 1996) . Probably, 
the reason of such different stability can be found in 
the different protease activities recorded in the 
vacuoles of the leaf meristem cells, with respect to 
those observed in the vacuoles of the seed. It is 
therefore understandable how research efforts are aimed 
at the generation of transgenic "plants in which the 
heterologous protein be preferably expressed in the seed 
(Owen and Pen, 1996). 

Furthermore, overall the seed constitutes the 
vegetal organ most used by man for its ease of 
conservation and, obviously, for its caloric and protein 
contribution. The seed consists of the plant embryo 
enveloped by storage tissues that provide energy and 
nitrogen during germination. Usually the storage tissue 
function is mainly effected by the endosperm, but in 
Legiiminosae, as e.g. Soya, the cotyledons develop 
remarkably, and acquire this function. 

The storage function for- the nitrogen component is 
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carried out by particular proteins accumulated in the 
protein bodies, inside compartments in the endocellular 
membrane systems , The amount of proteins in the seed 
varies from about 10-15% in cereals to about 25-35% in 
Leguminosae, therefore seeds are an important protein 
source in the human diet (Shewry et al., 1995), 

In order to exploit the expression system of the 
seed storage proteins, first of all it has to be 
considered that the storage proteins of all plants have 
some functional and physiological common characteristics: 
their synthesis is controlled during the seed ripening 
according to the nutritional needs, and they are stored 
in protein bodies. In particular, the Leguminosae storage 
proteins are divided in two classes: globulins and 
lectins. Globulins are the most widespread storage 
proteins, present not only in leguminosa but in 
monocotyledons and in other dicotyledons as well . The 
globulin class in turn is subdivided into two subclasses: 
legimins (lis hexameric proteins) and vicilins (7S 
trimeric proteins) . Also p-conglycinine and basic IS 
globulin, whose regulation elements were used to perform 
tissue-specific expression in seed (Shewry et al., 1995) 
belong to the vicilin subclass, but whereas the former 
was studied in detail, no detailed information is 
available on the basic globulin functioning. 

p-conglycinine is a storage protein of the Soya seed 
(Glycine max.), consisting of three different subunits, 
a, a\ p that interact non-covalently to form a trimer 
complex. The subunits are coded by a multigene family of 
15 genes grouped in six nuclear DNA regions, whose 
expression is strictly regulated so as to be modulated 
during the plant life (Harada et at. 1989) . Control is 
tissue-specific as well as stage-specific. In fact, the 
expression of each subunit is activated at high levels at 
the moment of embryo development, from the heart shaped 
phase until complete ripening, whereas it is repressed 
before the dormancy phase. 
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Moreover, expression occurs exclusively on specific 
plant zones, like cotyledons, according to an expression 
pattern that varies in the course of their ripening; at 
first it is activated on the outer cotyledon area, then a 
wave distribution from the outside to the inside is 
observed and lastly it turns out to be uniformly spread 
over the entire cotyledon tissue. However, during the 
heart shaped phase (18 days after pollination), the gene 
encoding the p-conglycinine also expresses itself in the 
embryonal axis, whereas -it does not express in endosperm, 
tegraen and in already differentiated tissues (Perez-Grau 
and Goldberg, 1989) . The same behavior also occurs in the 
seed of transgenic plants belonging to other families, as 
e.g., tobacco, proving the same control mechanism to be 
functional in solanaceae as well (Naito et al. 1988; 
Perez-Grau and Goldberg, 1989) . 

The regulation of subunits a/a * and |5 genie 
expression occur at a transcriptional as well as a post- 
transcriptional level {Harada et al., 1989). a^ subunit, 
of 7 6 Kd, with a 2-5 Kb mRNA, is accumulated more 
precociously and in a larger amount as compared to p 
subunit. This behavior is due to the greater strength of 
the a* subunit due to the presence of an enhancer region, 
absent in p, and of a sequence stabilizing the a' 
transcript, it also absent in p (probably a 560 pb region 
in the first exon of the a' transcript) (Harada et al., 
198 9) . This difference in expression has also been 
highlighted in transformed tobacco plants {Bray et al., 
1987) . 

In contrast with the a' subunit, the expression 
level of the p subunit, is also influenced by abiotic 
stresses, methionine level in the ground and presence of 
ABA. The base elements involved in gene control at 
transcriptional level for the a' subunit are clustered in 
the 905 pb region at 5' of the transcription start site, 
region called URS {Upstream Regulatory Sequence) . Inside 
this area specific sequences functioning as site-specific 
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enhancers have been detected. Among these, the legumin 
boxes {5'-CATGCAC.3' and 5' -CATGCAT-3' ) , elements that 
are found in many other genes encoding seed-specific 
proteins, in particular in legumes. A coordinated action 
of the two sequences determines a 10-fold increase of the 
seed gene expression level. The regulation by the above 
elements seems to be of a positive kind (Chamberland et 
al. 1992), however so far trans-elements specifically 
interacting in those sites have not yet been found. Site- 
specific expression also requires the coordinated action 
of elements operating in cis, not yet characterized, 
located in the region at 5' of the legumin boxes and at 
3' of the promoter (Chamberland et al., 1992). Probably 
the region at 5' includes a negative control site, 
specific for a nuclear factor present only in non-seed 
tissues . This factor would determine gene expression in 
embryonal tissues only. The importance of CAAT and TATA 
sequences in the control of site-specificity has also 
been proved. 

Four Soya nuclear factors that interact with 
specific sequences present in the a' and p URS of the 
gene have been identified as well. Two of those embryonal 
factors, SEF3 and SEF4, bind to sites inside the enhancer 
region (from -257 to -79 ). SEF3 binds to the middle of 

the sequence 5'-AACCCA .AACCCA-3' , present 

exclusively in the gene encoding the a' subunit . 
Accumulation and degradation of this protein < SEF3 ) is 
paralleled by accumulation and degradation of a' mRNA, 
supporting the hypothesis that SEF3 be involved in the 
control of a' expression. As compared to SEF3, SEF4 is 
present in a lesser amount, has many recognition sites 
(5'-A/GTTTTTA/G~3' ) in a', but mostly in p. The presence 
of this factor is correlated to the regulation of P 
expression (Lessard et al. , 1991) . Deletion and site- 
specific mutagenesis experiments have proved that the 
sole action of these factors does not affect the site- 
specificity nor the 'expression level, coordination with 
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the activity of the other regulatory elements being 
necessary. However, on the basis of the homology with 
light-induced proteins, these proteins are supposed to 
have a regulating role only under certain conditions 
(Fujiwara and Beachy, 1994} . 

Embryonal factors with a behavior similar to the 
SEF3, as verified in gel shift experiments, have been 
found in tobacco as well (Lessard et al., 1991). This and 
other data obtained with GUS activity assays under 
control of Soya a' promoter prove that the site- and 
stage-specific control mechanism is conserved (Lessard et 
al, 1991; Riggs et al . , 1989). It has also been 
hypothesized that tobacco trans factors binding to 
35SCaMV promoter may also interact with the legumin boxes 
(Katagiri et al., 1989). None of the afore mentioned 
factors appears to be directly responsible of the time 
regulation, and no NRS-lilce factor has been found 
possessing a negative control as in the case . of bean 
(Bustos et al. 1991) . 

More recent studies, concerning some legumins, and 
vicilins In Vicia Faba, contradict some generalizations 
on the regulation of the storage proteins expression in 
seeds (Wobus et al., 1995), showing that expression of 
genes of B4 and LeB4 Vicia Faba legumins is not limited 
to embryonal tis'sues, nor are they temporally restricted 
to the cell expansion phase in embryogenesis . Proteins 
are stored for short periods of time and then degraded in 
all embryonal tissues, suspensor and endosperm included, 
within well-defined developmental stages. This is so 
probably in order to allow an uninterrupted supply to the 
embryo of compounds having a high C and N content during 
all the growth and differentiation stages . Therefore, 
this data allow to hypothesize that the seed proteins 
expression be also controlled metabolically, and not 
merely at a developmental stages level. The possible 
relationship existing between storage proteins 
accumulation and carbohydrate metabolism (soluble glycid 



wo 00/04146 



PCT/n'99/00226 



10 

level) is presently being investigated. Since all classes 
of seed storage proteins share a similar behavior in the 
different species, this data require a careful evaluation 
of the behavior , in terms of expression, also for the 
Soya proteins {3-conglycinine and IS basic globulin. Data 
resulting from the study, which the present invention is 
based on, clearly show the tissutal specificity of the 
expression of the structural portion of lactoferrin under 
control of both the promoters. Instead, the activation 
phase was at the present not investigated in detail as 
the sole capability of total seed accumulation was of 
interest. Specifically concerning the 73 basic globulin, 
it is a storage protein of the Soya seed, with a high 
methionine and cysteine content. Alike |3-conglycinine, 
also IS basic globulin (Bg) is stored in seed in large 
amounts {3% of seed total proteins) . It consists of two 
subunits , one of 27 KDa, the other of 15 KDa encoded by 
the same mRNA, linked by disulfide bridges . Bg is 
synthesized as sole precursor polypeptide consisting of a 
putative peptide signal and of two subunits. This 
polypeptide is processed to yield the mature dimeric 
protein. In the genome about four copies of the Bg gene 
are present {Watanabe and Hirano, 1994) . 

This protein is mainly located in the seed embryonal 
tissues and its expression pattern is unusual for a 
storage protein. In fact, a portion of Bg is accumulated 
in the intercellular spaces of the cotyledon parenchyma 
(Nishizawa et al., 1994), whereas at an intracellular 
level it is stored in protein bodies on the middle 
lamella of cell wall and in the plasma membrane (Watanabe 
and Hirano, 1994) . This location suggests that the Bg is 
not a mere storage protein, having other functions as 
well . More accurate data on Bg location and expression 
period in Soya are not available . It has never been 
verified whether the site- and time-specific expression 
mechanism be preserved in other transformed vegetal 
species (like tobacco) . To this end, reference is made to 
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general data on storage proteins and to studies on Bg 
homologues in lupine (conglutyn y) , with which it has a 
high sequence homology. This protein is stored only in 
lupine embryonal tissues (cotyledons and embryonal axis) 
40 days after blooming. It has not been detected in other 
tissues such as leaves and sprouts . In seeds of 
transgenic tobacco, the conglutyn y gene is increasingly 
expressed from the 15th to the 20th day after blooming 
until the 40th, then begins to decrease (Ilgoutz et al . , 
1997) . 

One of the peculiar features of the Bg is that it is 
secreted in large amount from Soya seeds soaked in water 
at 40-60*'C. It is uncertain whether the secreted proteins 
are neosynthetized after heat-treatment, or instead are 
the proteins already present to be secreted. Since a 
post-heating increase in specific mRNA has been 
highlighted, it is assumed that the Bg is actually 
synthesized as a consequence of the thermal shock {Hirano 
et al. , 1989) . 

Not much is known on the regulation mechanism of the 
expression of the gene encoding the Bg protein, 
nevertheless sequences in the promoter region involved in 
the gene regulation have been identified. Besides the 
CAAT and TATA box sequences, respectively located at -116 
and -25 with respect to the transcription start site, 
three regulatory elements similar to thermo-specif ic 
sequence enhancers present at the non transcribed 5' 
region of genes in other organisms , have been detected . 
These heat shock elements {HSE) consist of two 5 pb 
conserved units: 5'-NGAAN-3' and 5'-NTTCN-3'. In the 
thermoregulated promoter of the Soya heat-shock protein, 
the enhancer elements, observed also in Bg as well, 
cooperate synergistically with three CCAAT box sequences 
located upstream thus increasing gene expression; these 
putative sequences are present also in the bg promoter. 

Sequences responsible of the site- and time- 
specificity expression were not identified. 
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The interest for this protein derives from the fact 
that it is accumulated in high amounts in the Soya seed 
(3% of total proteins) and therefore has a strong seed- 
specific promoter which can ensure a high level of 
expression of the gene it controls . Moreover, it is known 
that the regulation mechanism of this protein is 
different from that of the other storage proteins of Soya 
seed but the details are not known . However, studies on 
the promoter and on its site- and time-specific 
activation mechanism have never been carried out using 
reporter genes in transgenic plants. 

Both Bg and CONG, as storage proteins, are 
synthesized exclusively in the seed tissue and are stored 
in large amounts in cells constituting this organ, inside 
specific compartments. Concerning the post- 

transcript ional and above all the post trans lat ional 
regulation level, it runs through the mechanisms of 
intracellular transport and of protein 

compartimentalization, which are to date to be clarified 
in many aspects. 

In fact, those mechanisms involve all processes 
influencing the concentration, retention and distribution 
of the proteins in the endomembrane system (Okita, 1996) . 

However, general principles of protein targeting do 
exist, valid for all plant species.- 

1, Targeting information are contained in the 
proteins themselves, as discrete signals. Those sig^iais 
are intercepted by specific recognition signals such as 
receptors or simple interactions with membrane lipids. 

2 . Different types of signals do exist (topogenic 
sequences j each with a specific function . Among them 
there are signal sequences that start the protein 
translocation across specific endomembranes and interact 
with receptors/translocators facilitating the 
unidirectional transfer. Then there are stop and 
retention sequences that block the transfer to the 
membrane or to the inside of the compartment . Selection 
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sequences target proteins to the various cellular 
compartments. All those elements can be of a sequential 
type, i,e. localized in the N-terminus, central or C- 
terminus portion of the protein, or conformational, i.e. 
consisting of amino acids which although nonsequential, 
are yet adjacent in the native protein conformation. 
Moreover, there may be various signals simultaneously, 
and they can be modified or activated (e.g. by 
phosphorylation) . After transfer the signal is often 
deleted using specific cleavage sites for endogenous 
proteases . 

mRNA accumulation in a particular region also 
influences the protein location . Soya seed storage 
proteins, globulins as well as lectins, are stored in 
storage vacuoles. In fact, several types of vacuoles do 
exist. Some of them, besides having the function of 
maintaining the turgor pressure and of regulating ion, 
sugars and amino acid release, also constitute the 
depository of storage and defense proteins . The specific 
signal sequence for targeting to the vacuole has not been 
identified yet, apart from some plant species (Kermode, 
1996) . Probably, one or more surface regions of the 
correctly conformed protein are recognized by the 
selection mechanism. Plant cells possess the unique 
feature of accumulating storage proteins in the protein 
bodies, whereas in animals similar inclusion bodies are 
formed only when, an excess of protein synthesis occurs. 
Therefore, the latter protein bodies consist of unmuddled 
accumulation of conf ormationally incorrect or partially 
processed proteins . The formation and organization 
process of the protein bodies in plants remains unclear, 
although it is known that it consists of a series of 
ordered events (Okita, 1996) . Globulins are proteins with 
an acidic isoelectric point (pi ) , accordingly they are 
translocated in the endoplasmic reticulum (ER) and in the 
Golgi complex as soluble proteins . As soon as they reach 
the vacuoles, due to their low pH or possibly to the 
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processing and assembly, these proteins precipitate 
forming particled aggregates that will thereafter 
originate the protein bodies (Kermode, 1996) . 

In leguminosae, different storage proteins are 
accumulated in the same protein bodies with no spatial 
segregation. In other plant species the protein bodies 
form in the ER and are then absorbed in vacuoles by 
autophagocytosis (Kermode, 1996) . 

This general pattern is well-grounded for p- . 
conglycinine as well, though the specific vacuole 
targeting sequence have not been identified . for this 
protein. Instead, the p-conglycinine binding with a BIP- 
homologous protein has been- observed. This protein 
functions as chaperonine and, just like other proteins in 
different plant species, can have the role, of retaining 
p-conglycinine in the ER until its correct conformation 
is reached (Galili et al. , 1993; Shewry et al. , 1995; 
Kermode, 1996; Pontes et al., 1996). As for 7S basic 
globulin, available information is scarce . It is )cnown 
that it is located in protein bodies on the middle 
lamella of cell wall and in plasma membrane and not in 
vacuoles (Watanabe, 1994). For this reason, its mechanism, 
of division into compartments is hypothesized to be 
different from that of p-conglycinine. However, it is 
known that even wall-located proteins follow the same 
transport pathway of the vacuole proteins, i.e. are 
translated in the ER, then transported to Golgi and 
lastly secreted to the outside or inserted in the 
membrane by vesicular traffic. 

However, storage proteins expression in heterologous 
hosts shows that the compartimentalization mechanism is 
universal . In transgenic plants the seed vacuole storage 
proteins are correctly targeted. Nevertheless, sometimes 
transport can be inefficient, especially in vegetative 
organs with respect to seeds . In tobacco, iegurainose 
storage proteins are correctly targeted to vacuoles both 
in seeds and in leaves, yet in leaves there is 'a lower 
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accumulation level. This is so because a difference 
exists in the transportation efficiency or because of a 
different processing rate (different proteases or higher 
instability) . Hence, it can be understood how seed- 
specific in plant production of a heterologous human 
protein is a complex mechanism, so that the preliminary 
verifying of the functioning and efficiency of the 
expression system, as constructed in a model host 
organism, constitutes an important step . 

It has- been seen how tobacco is one of the most 
widely used plants to this end. Its preferential use in 
assays derives from the fact that it is one of the better 
known, plants, both in a genetic and in a biological and 
physiological respect . This, together with the e^se of 
effecting the genetic transformation and the shortness of 
vegetative cycle, made it become an important model for 
biotechnological experiments, a model whose 

transformation specific systems and micropropagation 
conditions are now better known. Additionally, tobacco 
possesses the further advantage of a near-complete 
extensibility of the obtained results to several other 
plant species, consequence of the high conservation rate 
of genie control mechanisms, that precisely proved to be 
usually highly conserved in other plant species, and in 
particular in leguminosae . Therefore it is particularly 
suitable for the study of the promoters taken out 
therefrom, and in particular of their capacity of 
allowing a gene of interest to be expressed in a 
controlled way in a transformed plant . Genes of interest 
are usually those encoding proteins suitable in a 
pharmaceutical or alimentary field. Accordingly, a 
heterologous gene of interest for this kind of 
application is that of the human lactoferrin, a protein 
belonging to the transferrin family, and as such capable 
to stably and reversibly bind two iron ions. 

In fact, by virtue of its biological functions 
lactoferrin turns out to be interesting from a 
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nutritional as well as from a pharmacological point of 
view . It is pr e s ent in human mi 1 k and has a fundamental 
role in neonatal feeding, as a matter of fact several 
biological functions have been attributed to this 
protein^ among which a bactericidal and bacteriostatic 
activity against a wide range of pathogenic 
microorganisms and the capacity of increasing iron 
absorption at the intestinal level (Lonnerdal and Iyer, 
1995; Hambraeus and Lonnerdal, 1993) . Moreover it 
promotes cellular growth, controls myelopoiesis and is 
capable of modulating the inflammatory response 
{Lonnerdal and Iyer, 1995; Oguchi et al . 1995; Penco et 
al. 1995). 

Therefore, at first, attempts to research in milk of 
other mammals a protein capable of binding iron and 
possessing the same properties were carried out. 

It has been observed that milk of all mammals 
contains two types of iron-binding proteins , present in 
different ratios: transferrin, identical to serum 
transferrin, and lactoferrin. Human milk has a 
particularly high lactoferrin content, in fact its 
concentration in colostrum is of 5-10 mg/ml, although it 
decreases during lactation to about 1 mg/ml in ripe milk 
(Hambraeus and Lonnerdal, 1993) . However, the amount of 
lactoferrin is much lower in milk of other animal 
species, like goat, horse, pig and mouse. In cow's milk 
for instance its concentration is of about 0 , 1 rag/ml . In 
some species such as rabbit, rat and dog, lactoferrin is 
absent, the prevailing iron-binding protein being instead 
transferrin. 

Further, lactoferrin (LF) produced by other non- 
human mammal species, assumes in each of them different 
structural characteristics, and therefore different 
properties . 

Human lactoferrin (LFU) is a 7 8 KDa monomer 
glycoprotein, with a bilobate structure . There is a high 
degree of homology between the N- terminus domain and the 
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C-terminus one, both at the amino acidic sequence (37%) 
and at the tridimensional structure level. The 
tridimensional structure has been described in detail by 
X-ray crystallography {Lonnerdal and Iyer, 1995) . The 
gene encoding LFU has been cloned and sequenced. Genie 
control mechanisms at a transcriptional and translational 
level and estrogens and iron role in those mechanisms are 
also known (Liu et al, 1991) . The mature protein consists 
of a 692 aa polypeptidic chain with a 8.8 - 9 pi. It 
contains 16 disulfide bridges and shows some resistance 
to proteolysis (Lommerdal, 1995) , has three glycanic 
polyacetylactosaminic chains bound with N-glycosidic 
bonds to the amino acidic residues Asn233, Asn476 and 
Asn545 and the molecular weight of the glycosilated 
protein is 82 KDa. 

One of the most important differences existing among 
lactoferrins (LFs) present in the various animal species 
is precisely the glycosidic chain composition. In fact, 
unlike human LF, bovine LF contains a-1,3 galattosidic 
residues and glycans of oligomannosidic type; the role of 
the glycosidic chains has not been defined yet, however 
it is possible that glycans protect LFU against attacks 
from proteolytic enzymes. Each of the two LFU domains is 
capable of binding tightly, yet reversibly to a ferric 
ion and at the same time to a carbonate or bicarbonate 
ion molecule (Hambraeus and Lonnerdal, 1993) . Iron 
binding sites in human milk lactoferrin are not 
completely saturated, but only at 6-8% of their capacity 
(Stowell et al., 1991). 

In recent years, a series of studies aimed at 
understanding the mechanisms of action and the relation 
between molecular structure and function of this protein 
have been carried out. The strategy adopted was that of 
studying LF molecules structurally altered by site- 
specific mutagenesis. Accordingly, expression systems of 
LFU recombinants producing in a simple and inexpensive 
way a protein as identical as possible to the one 
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purified from human milk were carried out. 

However, all heterologous hosts used so far for the 
LFU recombinant are eukaryotes, as, it being a complex 
glycoprotein, requires a sophisticated processing 
apparatus. 

In 1991 Stowell et al . cloned the LFU gene in 
cultured neonate hamster renal cells . An inducible Zn^'^- 
promoter and the secretion signal of a hamster endogenous 
protein were used to maximize expression. Concentration 
of LFU recombinant secreted in the culture medium was of 
about 20 rag/1, sufficient for crystallization and 
the refore for structural studies. Characterization 

Q revealed that it has the same molecular mass of native 

LFU maintaining intact the iron-binding site. It only 

J differs from human milk LFU in the glycosidic chains and 

N- terminus sequence, but these do not influence folding. 
. Such an expression system is highly expensive and not 

y suitable for the production of the amounts of proteins 

required at an industrial level. 

Then LFU was expressed in transgenic mice's milk. In 
this case the entire animal rather than the cultured 

n I; 

cells was transformed, using the regulation sequence of 
iM= the bovine gene a-Sl caseine. It was shown that LFU mRNA 

is exclusively expressed in female mammary gland during 
lactation. In milk the protein reaches a 0.1-36 \ig/l 
concentration. Recently this LFU recombinant has been 
characterized (Nuijens, 1997), and it has been observed 
that it has the same molecular mass, N-terminus sequence 
and immunoreactivity of native LFU. It also maintains the 
capacity of releasing iron at acidic pH and the bond to 
bacterial lipopolysaccharides . Also in this case, 
. glycosilation, as well as in vivo bactericidal and 
antiinflammatory action, are different from the ones in 
native LFU. 

Indeed, a highly significant production system of 
recombinant LFU is that carried out in Aspergillus 
awamori (Ward et al, 1995) . This method, which is 
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patented, yields commercial amounts of recombinant LFU (2 
g/1) . In order to maximize expression LFU is produced as 
a fusion protein with part of the structural gene, 
regulation sequence and secretion signal in the culture 
medium of the glucomylase. The fusion polypeptide is 
processed to yield mature LFU by an endogenous peptidase. 
Glucoamylase is an Aspergillus protein expressed in high 
amounts . 

An alternative eukaryotic host is the one used by 
Hitra and coworkers in 1994 . They have transformed 
tobacco cells in suspension. In the transgenic calluses a 
protein much shorter than the native LFU and therefore 
found to be unstable is produced in small amounts . 
Recombinant LFU shows activity against phytopathogenic 
bacteria, e.g., Xantomonas campestri, Pseudomonas 
syringae and others . In the above-mentioned study the 
obtaining of entire and fertile plants is not reported. 

Recently LFU was also produced in culture in insect 
cells, using Baculovirus as expression system (Salmon et 
al., 1997), This is a highly powerful expression system , 
yielding a recombinant protein identical to the native 
one, apart from the glycosilation level. Nevertheless it 
maintains the binding with the specific receptors. 

All the above reported expression systems allow to 
obtain an amount of protein sufficing for functional 
studies, and in some cases (Aspergillus) for commercial 
uses as well. In the latter case however, safe use of the 
purified protein, e.g. in milk substitutes for neonates, 
requires an excessive purification in order to ensure the 
absence of immunogenic or allergenic substances. However, 
transgenic plants have never been used to this end, nor 
products directly suitable in human nutrition have been 
ever yielded, for instance by the recombinant LFU 
expression in alimentary plants. 

Summary of the invention 

The present invention relates to a general system of 
tissue-specific, and in particular seed-specific. 
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accumulation of heterologous proteins, designed and 
carried out with the object of maximizing the production 
while preventing the degradation thereof, by using leader 
sequences and promoters of Bg and j5 conglycinine genes. 
To this end, the structural part of the selected gene may 
encode proteins having an enzymatic activity, used in 
human therapy or in industrial processes, or proteins 
with a general (lactof errin) or specific (antibodies or 
antigens ) pharmacological activity, or antibody proteins 
for phytopurification or for the elimination of 
mycotoxins present in foods. 

The present invention also relates in particular to 
a system that, enabling the in plant tissue-specific 
expression of the human lactof errin gene, provides an 
important solution to the problem of the production of 
this protein. This system in fact determines the 
production of plants capable of expressing relatively 
high amounts of this protein that, in the preferred 
embodiment providing the expression of a synthetic gene 
designed by the inventors so as to maximize its in plant 
expression, reaches industrially relevant levels. 
Moreover, such transgenic plants allow to avoid the 
costly product purification processes, as they can be 
used as nutraceuticalSr and therefore being directly 
intaken as alimentary products. Accordingly, also the use 
for the production of protein flours or of protein 
extracts yielded from tissues, and specifically from 
seeds of the afore mentioned transgenic plants, for the 
production of functional foods or special preparations 
for children is possible. 

The protein accumulation in the seed further allows 
to significantly increase the iron content of the same, 
or of the flours derived therefrom. Accordingly this 
system allows the obtaining of foods which, having a 
twofold iron content with respect to analogous product, 
can be also used for supplying this micronutrient' s 
deficiency through a normal alimentation . 
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These plants can anyhow be used also for human 
lactoferrin purification, by conventional methods based 
on chromatography techniques. 

This expression system further provides the use of 
new recombinant vectors, constructed by testing the 
effect of various leader sequences and processing sites 
of the mature protein, enabling the production of any 
protein of interest and in particular of lactoferrin or 
of fragments derived thereof, with a tissue-specific 
protein accumulation, in plants belonging to different 
families among which leguminosae, cereals, solanaceae, 
fruit-bearing plants and horticultural produce in 
general. In particular they are structured so as to have 
the following functionally linked components : (a) a 
promoter; (b) a signal sequence; <c) a nucleotide 
sequence optimized for in plant expression and 
corresponding to the amino acidic sequence of the entire 
human lactoferrin or to fragments thereof; (d) a 
polyadenylation signal . 

In particular the case concerning plasmid is 
considered, and wherein regulation elements and signal 
sequences used are those belonging to two genes encoding 
storage proteins that are very common in Soya seeds, i.e. 
a p-conglycinine and a 7S basic globulin, isolated and 
cloned from the Richland soya variety. They can be used 
to transform plant cells both by the Agrobacteriura method 
and by direct physical methods (Gelvin and Schilperoort, 
1995; DuPont Biolistic Manual, DuPont) . The vegetal 
transformed cells are hence selected with the selection 
agent provided for the purpose and induced to form entire 
fertile plants capable of forming seeds, in turn capable 
of expressing the gene for lactoferrin, and accumulating 
it as storage protein. 

This result was obtained by designing and 
synthesizing an artificial gene encoding the same amino 
acidic sequence of the natural human gene, but having 
sequence mutations such that codons most frequently used 
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by the human cell are replaced with those most frequently 
used by the vegetal cell. This result yielded the change 
of the 31% of the codons in the original gene (see table 
1) . The remarkable production of human lactoferrin 
detected in the various plants transformed with the 
synthetic gene, with yields going from 1% to 1.8% of the 
seed total storage proteins, but not in plants having the 
natural gene, proved the in plant functionality thereof. 
Furthermore on the basis of such sequence, which allows 
the efficient expression of the human lactoferrin in any 
vegetal cell, a person skilled in the art can derive the 
specific sequence expressed with a higher efficiency in 
each single vegetal species, by the simple application of 
the common general knowledge. 

Concerning all the above disclosed, object of the 
present invention is a polynucleotide encoding human 
lactoferrin, characterized in that it has a sequence 
totally or partially corresponding to the sequence 
reported as SEQ ID N0:1, or to a sequence biologically 
equivalent thereof and in that said sequence is optimized 
for in plant expression, and in particular the case 
wherein said polynucleotide has fused to the 5' -terminus 
end a sequence selected from the group comprising the 
sequences shown as SEQ ID NO: 13 and NO: 14. A particular 
case of sequence partially corresponding to the SEQ ID 
NO:l is given by a seqpaence corresponding to one or more 
fragments of said sequence. 

Ob j ect of the present invention is also the human 
lactoferrin protein, obtained from the expression of the 
afore mentioned sequences. 

A further object of the present invention is a 
recombinant DNA vector, in particular a plasmid, 
comprising at least one sequence of a gene of Interest, 
in particular the gene encoding the complete human 
lactoferrin, specifically a sequence totally or partially 
corresponding to the SEQ ID N0:1, operatively linked to 
regulation elements enabling the tissue-specific 
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expression of said gene. ■ A special case is the one where 
such regulation elements consist of an expression 
cassette for plants allowing tissue-specific expression 
of said gene. 

Further cases of interest are those wherein the 
expression cassette for plants are constituted by the 
regulation elements of the protein IS basic globulin, and 
in particular when among said regulation elements there 
is the sequence reported as SEQ ID N0:21, or the 
regulation elements of the p-conglycinine protein, and in 
particular when among said regulation elements there is 
the sequence reported as SEQ ID NO: 22, the case wherein 
the sequence of the gene encoding complete or partial 
human lactoferrin is operatively linked, or even fused, 
to a leader sequence, and the case wherein such leader 
sequence is selected from the group comprising sequences 
SEQ ID NO: 13 and NO: 14. 

Of particular importance is the case wherein such 
plasmid is selected in the group comprising vectors pBI, 
pGEM or pUC. 

A further ob j ect of the present invention is 
constituted by the transformation process of vegetal 
cells wherein the transformation is effected with one of 
the above-mentioned vectors, the transgenic vegetal cells 
can be obtained through transformation of wild type 
vegetal cells with at least one of the same vectors, and 
by the cells that anyhow contain a gene of interest, or 
portions thereof, and in particular the gene encoding 
human lactoferrin operatively linked in an expression 
cassette, enabling the tissue specific expression of the 
gene itself, in particular that of the gene encoding 7S 
basic globulin and that of the gene encoding 
conglycinine . 

A further object of the present invention is 
constituted by cellular aggregates and in particular 
calluses characterized in that they are obtainable by the 
aforesaid cells. 
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A further object of the present invention are also 
the transgenic plants obtainable from the aforesaid ceils 
with conventional techniques, or anyhow containing a gene 
of interest and in particular the gene encoding human 
lactof errin, specifically the one with a sequence 
corresponding totally or partially to SEQ ID N0:1, 
operatively linked in an expression cassette enabling the 
tissue specific expression of the gene itself . 

Of particular relevance is the case wherein such 
transgenic plants are selected from the group comprising 
solanaceae, cereals, leguminosae, horticultural produce 
and fruit-bearing plants in general, in particular Soya, 
tobacco and rice, wherein the gene encoding lactoferrin 
is specifically expressed in the storage tissues or in 
the fruit . A further object of the present invention is 
the use of such transgenic plants as nutriceutlcals . 

A further object of the- present invention is 
constituted by the production processes of functional 
foods containing proteins produced by the aforesaid 
transgenic plants, of vegetal milks, starting from 
natural and/or concentrated proteins deriving from the 
above-mentioned plants, and anyhow any human lactoferrin 
production process, characterized in that it utilizes the 
aforesaid plants. 

Lastly, object of the present invention is also the 
human lactoferrin obtained from the aforesaid transgenic 
plants. 

The invention will be better described with the aid 
of the annexed figures . 

Description of the figures 

Figure 1 shows the strategy adopted for the assembly 
of the synthetic gene encoding human lactoferrin. 

Figure 2 shows the map of plasmids pGEM-PGLOB (A) and 
pGEM-PCONG (B) obtained from the cloning of the Soya 
promoters in plasmid pGEM (Promega) , in which the 
restriction sites used to derive the plasmids are 
highlighted. 
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Figure 3 shows agarose gel electrophoresis analysis 
of digestion of plasmids pGEM-PGLOB with Sal I (lanes 1, 
2, 3 and 4) and pGEM-PCONG CON Sph I (lanes 6, 7, 8 and 
9) , carried out to test clockwise orientation of the 
insert. All PGLOB samples tested positive, yielding 
fragments of the expected sizes. PGLOB 1 is sample 1 
selected for the subsequent molecular work. In contrast, 
PC0t3G samples did not yield the expected pattern, 
suggesting the possibility of the presence of errors due 
to the adopted cloning technique (hypothesis later 
discarded, see figure 6) or due to isolation of a variant 
of the control region that in the Richland variety 
differs from the one of the disclosed sequence (Dare 
variety). Sample 5 is lambda marker Hindlll. 

Figure 4 shows agarose gel electrophoretic analysis 
of . the restriction pattern of two plasmxd pGEM-PCONG 
clones with enzymes Nde I (lanes 1 and 2) , Rsa I (lanes 3 
and 4), and SnaB I (lanes 9 and 10), carried out in order 
to test orientation and identity of the constructs. Cuts 
with Nde I , Apa I and SnaB I yielded the expected 
patterns in contrast to the cut performed with Rsa I; 
this results are justified from the differences found in 
sequence and reported in figure 6 . In lanes 5 and 6 
markers of molecular weight A--DNA Hind III and Marker IV 
(Boehringer) respectively are present. 

Figure 5 shows electrophoretic analysis of the 
restriction of various clones of plasmid pGEM-PCONG with 
Rsa I (lanes 1-6) and Hinf I (lanes 9-14) enzymes in 
order to test identity of constructs . In both cases the 
obtained pattern do not mirror the expected ones, but are 
conserved among the different clones, thereby suggesting 
their being due to differences in the original sequence 
with respect to the published restriction map and not to 
errors in the amplification phase with Taq polymerase or 
in the cloning. Adopted markers are X-DNA Hind III and 
Marker IV. 

Figure 6 shows the comparison between the published 
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sequence of gene CONG promoter region and the one cloned 
in plasmid pGEM-PCONG. 

Figure 7 shows a schematic view of the two plasmids 
resulting from the cloning of the native gene LFU into 
the two vectors pGEM-T and pBI121, carried out in order 
to obtain plasmids used later on as transformation 
control . 

Figure 8 shows the map of the two plasmids resulting 
from synthetic LFU gene cloning into vectors pGEM-PGLOB 
and pGEM-PCONG, i.e. plasmids pGEM-PGLOB-LFU (A) and 
pGEM-PCONG-LFU (B) , respectively. 

Figure 9 shows the map of plasmids pBI-PGLOB-LFU (A) 
and pBI-PCONG-LFU (B) wherein the restriction sites used 
are highlighted. In particular, box (A) shows the 
construction of a plasmid containing the synthetic gene 
represented in the sequences list as SEQ ID N0:1 and 
cloned in plasmid pBIlOl fused to promoter PGLOB and in 
Open Reading Frame with the "leader" of 7S basic 
globulin. 

In contrast, box (B) shows the construction of a 
plasmid containing the synthetic gene reported in the 
sequence list as SEQ ID NO: 1 and cloned in plasmid 
pBIlOl fused to promoter PCONG and in open reading frame 
with the p-conglycinine leader. 

Figure 10 shows electrophoretic analysis of the 
restriction of various clones of plasmid pBI-PCONG-LFU 
with enzymes Xba I, BamH I, Sac I; samples 4 and 5 test 
positive . In position 6 molecular weight marker Ladder 
1Kb is present. 

Figure 11 shows electrophoretic analysis of the restriction 
of two clones of plasmid pBI-PCONG-LFU with enzymes Sal I 
{lanes 1 and 2) and with Xba I and Sac I {lanes 4 and 5) ; 
Both samples tested positive. Samples 3 and 6 represent 
the positive controls, i.e. pGEM-PCONG-LFU digested with 
Sal Ir Xba I and Sac I respectively. 

Figure 12 shows agarose gel electrophoresis analysis 
of PGR products from genomic DNA extracted from various 
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plants transformed with pBI-PGLOB-LFU, performed using 
primers PLT48 and PLT49 for the promoter sequence PGLOB. 
Positive samples 2, 3, 4 and 5 represent the band of the 
DMA amplified to 1500 base pairs, while samples 6, 7 and 
8 represent the negative control of PGR and the positive 
control (pGEM-PGLOB) respectively. Molecular weight 
markers Ladder 1Kb are found at 1 and 9. 

Figure 13 shows in box (A) the agarose gel with the 
genomic DNA of tobacco transformed with pBI-PCONG-LFU 
{lanes 1-5) or with pBI-PGLOB-LFU (lanes 6-9) cut by 
enzyme BamH I . M is the molecular weight marker Ladder 1 
Kb. Sample 10 shows the positive control pGEM-LFU, not 
shown in the photo for quantitative reasons. In box (B) 
the hybridization pattern of human LF on the genomic DNA 
of the same tobacco plants is shown; samples 1, 2 and 3, 
belonging to plants PCONG 1, PCONG 3, and PCONG 4 
respectively are positive, as is the case for samples 5, 
7 and 8, belonging to plants PGLOB 10, PGLOB 3 and PGLOB 
4 respectively. It is evident that pGEM-LFU, the positive 
control (lane 10) , was only partially digested as also 
the super-coiled plasmid forms are present. 

Figure 14 shows SDS-PAGE electxophoretic analysis of 
proteins partially purified from seeds of the transgenic 
plants tested with Southern analysis of- the preceding 
figure (A) and ^Western analysis .of the same proteins, 
after transfer to a membrane, using polyclonal antibodies 
specific for the human lactoferrin (B) . In particular, in 
box (A) SDS-PAGE electrophoretic analysis of total 
cellular proteins (30 DAP) from mature seeds of 
transgenic tobacco is shown. In position 2, 3, 4 and 5, 
6, 7 the same samples tested positive to Southern 
analysis, extracted by buffer at pH 2.7 and pH 7.6 
respectively, are found. Samples 8 and 10 represent the 
positive control (milk-extracted human lactoferrin, 
Sigma) and the negative control (non-transformed plant of 
the same variety) , while in position 9 the molecular 
weight marker Rainbow (Amersham) is found. In box (B) 
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autoradiography of anti-lactof errin antibody 

hybridization with the same proteins transferred to DEAE- 
nitrocellulose membrane is shown; the sample 
corresponding to plant PGLOB 10 , in position 2 and 5, 
does not yield a positive signal, although according to 
Southern analysis it is transformed. All other samples 

. are positive. 

Figure 15 shows electrophoretic analysis ' of raw 
proteins extracted from seeds and leaves of the 
transgenic tobacco transformed with the plasmids of which 
at figure 9. In particular, in box (A) protein coloration 
carried out with Coomassie blue is reported. In box (B) 
Western Blotting carried out with human LFU-specific 
antibodies on the proteins of the gel shown in box (A) 
after transfer to membrane is reported. In particular, in 
lane 1 plant PGLOB 1, with leaf -extracted proteins is 
reported, in lane 2 always plant PGLOB 1, with seed- 
extracted proteins is reported, in lane 3 plant PGLOB 3, 
leaf proteins, is reported, in lane 4 plant PGLOB 3, seed 
proteins, is reported, in lane 5 plant PCONG 105, leaf 
proteins, is reported, in lane 6 plant PCONG 105, seed 

. proteins is reported, in lane 7 plant PCONG 105, seed 
proteins treated with N-deglycosilase F, is reported {see 
text) , in lane 8 the molecular weight marker is reported, 
in lane 9 the human LFU present -on the market treated 
with N-deglycosilase F is reported, in lane 10 LFU 
present on the market is reported. 

Figure 16 shows Western analysis of LFU protein 
extracted from human milk and of recombinant protein 
isolated from tobacco seed, before and after N- 
deglycosilase F enzyme treatment . Analysis was performed 
with human lactof errin specific antibodies . In lane 1 LFU 
extracted and purified by HPLC from seeds of plant PCONG 
105 is reported; in lane 2 a protein as in 1 after a 18- 
hours treatment with N-deglycosilase F is reported; in 
lane 3 commercial LFU after a 18-hours treatment with N- 
deglycosilase F is reported; in lane 4 commercial LFU is 



wo 00/04146 PCT/IT99/00226 - 

29 

reported^ a diminution is apparent in the molecular 
weight of the two enzyme-treated samples (2 and 3) . 

Detailed description of the invention 

The strategy adopted for the generation of 
transgenic plants capable of producing human lactof errin 
was developed along two directions: on the one hand, 
comparative analyses on plant expression systems, 
particularly tobacco and Soya, have been carried out, in 
order to have a basis for the designing of a sequence 
encoding hiiman lactoferrin thereof, sequence optimized to 
maximize its expression in vegetals. Accordingly in the 
sequence designing the necessity that the required post- 
translational modifications for the production in the 
active form could be effected on the translated protein, 
and that, both for its conformation and due to its 
subcellular localization, the protein be sufficiently 
stable to. be accumulated in relevant amounts in the 
transformed plants, was taken into account. This proved 
crucial, having ascertained after various attempts 
carried out in the past years the impossibility of an in 
plant production of human lactoferrin using constitutive 
expression systems (e.g. promoter 35S) as well as 
promoters inducible by leaves cut. Moreover, besides 
difficulties related to the type of promoter used, the 
production level_ and the stability- of the protein were 
tested to be scarce and depending on a warped 
preferential use of the codons between the human gene and 
the plants. 

Therefore, a plasmid vector system was developed 
utilizing vectors containing a newly synthesized 
lactoferrin gene regulated by tissue- and stage-specific 
promoters capable of yielding a high gene expression and 
of accumulating the protein in a stable and efficient way 
inside seed storage organs . Moreover the selection of 
leader sequences and the design of the fusion point 
between those and the structural portion of the mature 
protein yielded a lactoferrin protein that, in 
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quantitative and possibly also in qualitative terms, has 
the same glycosilation level and the same amino terminal 
sequence of the native protein, which is important for 
some of its functional characteristics . 

Concerning the synthetic gene design, all the 
necessary and possible triplets were modified taking into 
account their preferential use in the two reference 
plants, tobacco and Soya. In particular, data represented 
in table 1 were used- 
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In carrying out such operation, the value G+C and 
A+T of the two systems (human and vegetal) , the non- 
tandem repeat of some triplets that may cause shifts in 
reading, etc. were also taken into due account. 

Synthetic LFU gene was then obtained using primers 
reported in the annexed sequence listing from SEQ ID NO; 8 
to SEQ ID NO: 12 and from sequence SEQ ID NO: 15 to SEQ ID 
NO: 20 and following the assembling strategy reported in 
figure 1, consisting in repeated PGR cycles, using for 
each cycle different pairs of synthetic primers allowing 
the gradual elongation and the foinning of the final 
sequence as designed. Similarly on the basis of such 
final sequence the sequences contained the codons 
preferably expressed in various species of interest, 
among which rice, have been obtained. 

In parallel, also native LFU gene (wild type) 
encoding human lactoferrin was cloned, always by PGR 
technique, starting from a cDNA library of human 
mammalian tissue (Glontech) . The gene was recovered in 
its structural part lacking the signal peptide and the 
poly-A site and cloned in pGEM-T to form plasmid pGEM-LFU 
whose map is represented in figure 7. Primers designed 
for amplification are reported in the annexed sequence 
listing at sequences SEQ ID NO: 2 and SEQ ID NO: 3; those 
added the restriction site BamHI at 5' and the 
restriction site Sad at 3' . After checking the sequence, 
which tested identical to the published one, the yielded 
natural gene- was cloned in vector pBI121, 6n sites BaraHI 
and Sad under control of promoter 35S (see figure 7) , 
this plasmid (pBI-LFU) was then denominated pBI-35S-LFU 
and used as control in the genetic transformation 
experiments and in the subsequent molecular and 
biochemical analysis . 

Concerning the preparation of the recombinant 
vectors containing the elements that allow the tissue- 
specific expression of the LFU gene in expression 
cassettes for plants, we proceeded as follows / In order 
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to obtain the seed-specific expression of the protein the 
promoters and signal sequences of two genes encoding 
storage proteins that are very abundant in Soya seeds, 
i.e. a p-conglycinine (CONG) and a IS hasic globulin 
(GLOB) were used. 

These regulation sequences were isolated and cloned 
from Soya, Richland variety. 

In particular, to clone the two GLOB and CONG 
sequences PGR technique (PGR == Polimerase Chain Reaction; 
Innis et al . 1990) was used. In this case genomic DNA 
extracted from Soya leaves of Richland cultivar was used. 
Oligonucleotides used for specific amplification are 
reported in the annexed sequence listing from SEQ ID NO : 
4 to SEQ ID NO: 7. 

For the GLOB promoter the cloned region includes the 
entire regulation sequence and the sequence encoding the 
signal peptide (leader) plus the first codon of the 
structural sequence, such sequence is indicated with SEQ 
ID NO: 13 in the annexed sequence listing. For the CONG 
promoter the cloned region includes the entire regulation 
sequence and the sequence encoding the signal peptide 
plus the first codon of the structural sequence, such 
sequence is indicated with SEQ ID NO: 14 in the annexed 
sequence listing. For both regulation sequences the most 
suitable restriction site for insert-ion proved to be Xbal 
(TCTAGA) , while downstream proved to be BamHI (GGATCC) , 
both absent from the native and synthetic lactoferrin 
sequence, as highlighted in the following tables 2 and 3. 
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TABLE 2 (continues) 
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TABLE 3 (continues) 
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DNA template was extracted from Glycine max leaves, 
Richland variety, and amplification products match sizes 
expected for GLOB (1515 pb) and CONG (1163) promoter on 
the basis of EMBL sequence data. 

Therefore, starting from fragments amplified by 
ligation in vector pGEM-T, the two vectors pGEM-PGLGB and 
pGEM-PCONG, whose map is reported in figure 2, were 
constructed. Yielded plasmids were tested by restriction 
analysis performed with several enzymes chosen among 
those cleaving in a limited number and with an overall 
sequence distribution (see figures 3, 4 and 5) and a 
clone for each type was selected and sequenced. Sequenced 
clones showed to be significantly different from the 
expected sequence. As an example, a comparison between 
the data bank promoter CONG sequence and the one obtained 
sequencing clone pGEM-pCONG is reported in figure 6. A 5% 
difference was detected, therefore the two promoters can 
be considered as different. 

The synthetic gene for human lactoferrin was cloned 
at first in plasmids pGEM-PGLOB and pGEM-PCONG, cut with 
enzymes BamHI-SacI, to form plasmids pGEM-PGLOB— LFU and 
pGEM-PCONG— LFU respectively, whose map is disclosed in 
figure 8, and then the construct Xbal-SacI transferred in 
vector pBIlOl cut with the same enzymes. In the event of 
a plant transformation carried out with physical means, 
as for rice in our case, plasmids pGEM-PGLOB-LFU and 
pGEM-PCONG-LFU, can be directly used after addition of a 
terminator, in cotransformation with a vector containing 
the selection marker (e.g., a PUC-type vector containing 
the gene for hygromycin resistance) . Resulting plasmids 
pBI-PGLOB-LFU and pBI-PCONG-LFU, whose map is reported in 
figure 9, were used in the genetic transformation of the 
plants after an accurate control carried out by 
restriction with various enzymes to assay the correct 
integration of the DNA construct (figures 10 and 
11) - Plasmids pBI-35S-LFU, pBI-PGLOB-LFU and pBI-PCONG-LFU 
were transferred in A. tumefaciens EHA105 strain cells. 
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made competent by electrophoresis. Strains containing the 
three plasmids were used to transform about 450 leaf 
disks (LD) of tobacco. Petit Avana variety. Formation 
first of shoots and then of roots was induced from 
calluses formed on leaf disks (LD) in presence of 
kanamycin. Once rooted, plants were potted and at least 
50 kanamycin-resistant plants were analyzed for each 
construct. The same plasmids containing, this time, the 
hygromicyn-resistance genes marker, are used also for the 
transformation of rice. In this case in particular 40 
plants hygromicyn resistant were analyzed. 

Plants To, of rice as well as of tobacco, were tested 
by PGR technique (figure 12), assaying the presence of 
the lactoferrin gene inside the genome of the tested 
plants; plants Ti were assayed by Southern analysis 
(figure 13), that compared to PGR technique allows a more 
accurate testing of the transgene presence in the genome, 
and with Western analysis (figure 14) allowing detection 
of genie product and therefore the functionality of 
inserted gene. 

All plants with native LFU gene under control of 
promoter 35S led to accumulation of a protein, recognized 
by LFU-specific antibodies, of a molecular weight lower 
than 50 KDa. This protein was found in small amounts in 
young leaves, becoming undetectable in the fully 
developed leaves. Plants of rice as well as of tobacco 
transformed with the two constructs pBI-PGLOB-LFU and 
pBI-PCONG-LFU produce and accumulate exclusively in seed 
a protein having a molecular weight of 82 KDa 
corresponding to the glycosilated human protein as shown 
by electrophoretic analysis of extracted proteins and by 
the related Western Blotting carried out with LFU- 
specif ic antibodies {see figure 15) . Presence of 
recombinant protein exclusively in the seed and not in 
the leaves was assayed in all the examined transgenic 
plants (about 50 for the two constructs) with Western 
techniques . 
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Recombinant LFU protein isolated from seed and 
purified with HPLC technique showed to be identical to 
the native protein concerning its iron binding capacity 
and its inhibiting effect towards the examined bacterial 
strains. Treatment with a deglycosilating enzyme confirms 
the presence of posttranslational modifications in all 
alike, at present at least in quantitative terms, to 
those present in native lactoferrin as highlighted by 
Western analysis, the results of which are disclosed in 
■ figure 16. 

Moreover, the contribution of iron consequence of 
the introduction of the lactoferrin gene in rice plants 
was assessed. In particular, in the following table 4 
results in terms of iron content of some transgenic lines 
of the Ariete and Rosa Marchetti varieties transfomed 
with plasmids pBI-PCONG-LFU and pBI-PGLOB-LFU are 
reported. Iron content was measured by atomic absorption 
after flour mineralization with HNO3 e H2O2. 

Table 4 : Analysis of " iron content in Ariete (A) and Rosa 
Marchetti (RM) varieties and of the respectuive transgenic lines 
capable of in-seed accumulation of protein lactoferrin. 
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A cl. 2-3 
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0.7 


A cl. 5-1 
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1.2 
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3.2 
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RM cl. 5-3 
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RM cL 2-5 
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RM cl. 3-6 
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3.0 


RM cl. 4-1 


48.2 


1.7 



It is therefore evident from all of the above- 
reported results that using the native gene, described in 
the literature, for human lactoferrin under control of 
the traditional promoters used for genetic transformation 
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of plants, human protein lactof errin cannot be produced 
in relevant amounts, in a stable form and with the 
posttranslational modifications typical of this protein. 

So far a general description has been given of the 
present invention. With the aid of the following 
examples, a more detailed description will now be given 
of specific embodiments thereof, with the purpose of 
giving a clearer understanding of objects, features, 
advantages and methods of application of the invention. 

EXAMPLE 1: 

Agrobacterium Tumef aciens-mediated tobacco transformation 
Day 1 

A small amount of Agrobacterium tumefaciens of 
strain EHA 105, taken from a petri plate culture with a 
sterile loop so as not to exceed in the amount thereby 
avoiding subsequent problems in controlling bacterial 
proliferation on plated leaf disks, was inoculated in 2 
ml of sterile LB. Then, from a healthy tobacco plant of 
Petit Avana variety a leaf showing no alteration 
whatsoever, conversely showing optimal turgor condtions, 
was taken. The leaf was briefly washed in bidistilled 
water to remove surface impurities, immersed for 8 min in 
a 20% sodium hypochlorite and 0.1 % SDS solution and left 
to dry under a vertical flow hood. From then on all steps 
were carried out -under hood. In particular, the leaf was 
immersed in 95 % ethanol and shaken in order to 
completely wet the pages thereof (letting the petiole 
emerge) for 30 - 40 sec. The leaf was then allowed to dry 
out completely. 

Disks were obtained from the entire leaf surface 
with an ethanol-sterilized punch, let fall on plates with 
MSIO free of antibiotics; in particular, the ratio of 30 
disks per plate was not exceeded. 

Next, 2 ml LB + (just inoculated) Agrobacterium were 
poured on plate, and the bacterial suspension was evenly 
spread over the entire plate with a gentle rotatory 
movement, in order to obtain an homogeneous bacterial 
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distribution among the disks. LB in excess was carefully 
aspirated with a pipette. In the course of those steps at 
all times a parallel negative control was provided by- 
means of a plate to which nothing, or only LB was added. 

Then plates were incubated at 28 °C for 24-48 hours 
in constant lighting conditions, and bacterial growth was 
indicated by the appearance of a thin opaque layer 
spreading over the entire plate. 

Day 2 

Leaf disks (=LD) were carefully transferred on a 
plate with MSIO + 500 mg/1 cefotaxime, and incubated at 
23 "^C for 6 days in constant lighting conditions. This 
step determines the Agrobacterium inactivation. 

Day 8 

LD were then carefully transferred on .a plate with 
MSIO + 500 mg/i cefotaxime and 200 mg/1 Kanamycin, and 
incubated at 28 °C for 14 days in constant lighting 
conditions . This step determined a selection of the 
transformed plants: in fact, gene of kanamycin resistance 
was carried by the plasmid inserted in Agrobacterium. 

Day 22 

LD, that in the meantime had grown developing a 
callus, were carefully transferred on a plate with MSIO + 
500 mg/1 cefotaxime, 200 mg/1 Kanamycin and 500 mg/1 
carbenicillin, and incubated for ' 6 days. This step 
determines elimination of the agrobacteria possibly 
survived to the previous antibiotic treatments (a very 
frequent occurrence) . 

Day 28 

LD were transferred again on MSIO + 500 mg/1 
cefotaxime and 200 mg/1 Kanamycin, and incubated until 
shooting. When shoots showed at least two leaves, they 
were separated from the callous mass and transferred on 
the radication medium: MSO + 500 mg/1 cefotaxime and 200 
mg / 1 Kan amy c i n . 

At the appearance of roots, seedlings were extracted 
from the plate, freed from agar residues, gently washed 
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in running water and planted out in loam and sand (2:1) 
inside small plastic pots . Soil was previously saturated 
with water, then pots were covered with transparent 
plastic lids to preserve high humidity conditions, and 
placed in a growth chamber at room temperature, with a 
daily 16-hour lighting period. 
EXAMPLE 2: 

Rice transformation by physical methods 
Rice seeds of Ariete and Rosa Marchetti varieties 
were harvested at milky- ripening, when the endosperm is 
still liquid. The embryo was isolated with a lancet after 
removal of the two glumes . Immature embryos are of 
different size and shape, depending on the number of days 

- elapsed from blooming: The ones deemed most suitable for 
the in vitro culture and the successive transformation, 
i.e. those of an 1.5 mm average size, were cultured on a 
medium containing 2, 4-D auxine to promote scutellura cell 
division and suppress differentiation of young embryos, 
obtaining cellular proliferation of the scutellum area. 
At this stage embryos underwent bioholistic 
transformation, with the following parameters: particle 
size 1 - 5-3 ^m, particle concentration 500 ]ig, membrane 
rupturing pressure 1.100 psi, membrane-microcarrier gap 6 
mm, microcarrier stop-point gap 6 mm, stop-point target 
gap 10.5 cm. 

Transformation was effected by a cotransformation 
technique, using as a selectable marker the gene for 
hygromycin on plasmid pR0B5. Cotransformation was 

- effected with a total DNA concentration of 1 yig/ul using 
0.6 pg DNA for bombing and with a selectable 
plasmid/suitable plasmid (pROBS with pBI-PGLOB-LTU or 
with pBI-CONG-LTU) ratio of 1:1 assessed on the number of 
molecules (abt. 1:4 in amount). 

Osmotic conditions of the plant material were 
optimized carrying out a preculture on 3% saccharose and 
plasmolysis prior to bombing on MS with 10% saccharose 
f or - 1 hour. 24 hours after bombing with PDS-lOOO/He 
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bioholistic sysrem the material was transferred on 3% 
saccharose medium. 

For selecting the transformational events , bombed 
tissue underwent selection in presence of hygromycin 3 
(Duchefa) . 1 day after bombing embryos were transferred 
on solid MS medium, additioned with 2 mg/1 2,4-D auxine, 
50 mg/1 cefotaxime, 50 mg/1 hygromycin B, 3% saccharose 
and 0.35% agarose. One week later embryos were 
transferred on R2 liquid medium (Ohira et al. , 1973) 
containing 1 mg/1 thiamine, 50 mg/1 cefotaxime, 50 mg/1 
hygromycin B and 3% saccharose, pH 5.8, in 190 ml plastic 
vessels (Greiner) . Liquid cultures were shaken at 90 rpm 
and 28 °C in the dark, replacing the liquid medium every 7 
days , After 3-4 weeks resistant calluses formed on the 
embryo surfaces are transferred on R2 solid medium 
additioned with MS vitamins, 2 mg/1 2,4-D auxine, 50 mg/1 
cefotaxime, 50 mg/1 hygromycin B, 60 g/1 saccharose and 
0.5% agarose and maintained for 2-4 weeks, until 
formation of embryogenic structures. 

At the formation of embryogenic structures calluses 
were transferred on solid MS regeneration medium 
containing 2 mg/1 BAP, 0.2 mg/1 NAA, 3% maltose, 50 mg/1 
hygromycin B, 50 mg/1 cefotaxime and 0.8% agarose. 

Embryogenic calluses were maintained in phytotron at 
28''C with 16 hours of lighting to ' induce formation of 
shoots, that, once formed, were transferred on hormone- 
free 1/2 MS radication medium, 3% saccharose and 0 .3% 
gelrite. After 3-4 weeks at 28 "C, seedlings were 
transferred in a Yoshida solution (Yoshida et al. 1976) 
and maintained at 25/19 °C day /night with a 11 h day 
length , After 4 weeks plants were potted and grown in 
hothouse until the cycle end. 

EXAMPLE 3: 

Purification of lactoferrin protein from dif ferent 
tissues of the plant and assessment of molecular weight . 

Extraction of all the proteins of tobacco seed was 
performed grinding the seeds in liquid nitrogen in 
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presence of an extraction buffer (0.5 M saccharose, 0.1% 
ascorbic acid, 0.1% Cys-HCl, 0,01 M Tris-HCl, 0.05M EDTA 
pH 8) . 

Then the solution was centrifuged for 30 minutes at 
14.000 rpm at 4^0 and the supernatant was kept with the 
soluble proteins. 

Then the solution was filtered with filters of 0.2 
^tm porosity, and the iactoferrin partially purified by 
removing proteins of a molecular weight lower than 30 KDa 
by centrif ugation in Centricon 30 column (Amicon) . 

The Iactoferrin was further purified by HPLC 
chromatography on Resource Q column (Pharmacia) at a weak 
cationic exchange, with elution in phosphate buffer pH 7 
and NaCi gradient 20-100%. The peak corresponding to 
Iactoferrin eluted at 0.7 M NaCl. 

The fractions of the elution range were reunited and 
filtered in Centricon 30 to remove salt. 

For the Iactoferrin extraction from tobacco leaves , 
up to the centrifugation step we proceeded as in the case 
of extraction from seed, then the supernatant was 
additioned with 60% (NH4)2S04 and left shaking In ice for 
60 min. 

Then the solution was centrifuged at 14.000 rpm for 
15 minutes at 4°C , the pellet recovered and then 
suspended again in" phosphate buffer pH 6.8. 

For the assessment of molecular weight in SDS-PAGE, 
the colorant (SDS loading buffer) was additioned to the 
Iactoferrin sample ^20^1) and the samples were loaded 
onto 8% polyacrilamide minigels. Running conditions were: 
initially 10mA, and 20 mA for the entire run, in Tris- 
glycine Ix buffer. Then the gel was stained by Silver 
staining technique and the molecular weight assessed 
referring to molecular weight standards. 

EXAMPLE 4: 

Western analysis of the Iactoferrin protein produced 
in plant and deglycosilation thereof. 

Lactoferrxn purified from seed according to example 
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2, after electrophoretic separation on acrylamide gel was 
transferred by electroblotting (buffer 25mM Tris, 192 mM 
glycine, 20% methanol, 45 V at 4°C) to a nitrocellulose 
membrane (BASS Schleicher and Schull) . 

The membrane with the immobilized protein was shaken 
for 60 min in TBS-T 5% Skin milk solution and then, after 
some washings, with the same solution containing the 
primary antibody in a 1:2500 ratio. 

After reaction with primary antibody the membrane 
was washed and placed-- in contact with the secondary 
antibody (Anti-Rabbit peroxidase conjugate) , always in 
TBS-T Skin milk solution, in a 1:12.000 ratio. 

After reaction with secondary antibody the membrane 
was washed several times and placed in contact with 
Araersham' s chemiluminescence kit ECL. 

The membrane was then exposed in contact with a 
photoplate (Hyper film MP, Amersham) in " darkroom for 
variable lengths of time . 

Deglycosilation with N-glycosidase F enzyme 
(Boehringer Man. ) was carried out using 10 y.1 in volume 
of glycopeptide (10 p,g) denatured in 0.1% SDS brought to 
boiling point for 2 rtixn. 

To this solution 90 ^l of buffer (20 mM phosphate 
buffer pH 7.2, 50 mM EDTA pH 8, 10 mM sodium azide, 0.5& 
NP40, 1% p-mercaptoethanol) were additioned and it was 
brought to boiling point again for 2 min, then cooled at 
37''C, 

To the resulting 100 }zl 1 U of N glycosidase F was 
additioned and let incubate at 37^0 for 18 hours. Then 
the reaction product was analyzed on SDS-PAGE gel and the 
lactoferrin protein detected by Western technique. 

GLOSSARY 

The term "recombinant polynucleotide", as it is used 
here to characterize a polynucleotide useful in the 
production of lactoferrin, relates to a polynucleotide of 
genomic origin, cDNA, semi-synthetic or synthetic, that, 
by virtue of its origin or manipulation: 1) is not 
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associated to a portion or to the totality of the 
polynucleotide to which it is associated in nature, and/or 
2) is linked to a polynucleotide differing from that to 
which it is associated in nature, or that 3) does not 
exist in nature. 

The term "polynucleotide", as it is used here, 
relates to a polymeric form of nucleotides of any length, 
ribonucleotides as well as deoxyribonucleotides . This term 
exclusively refers to the molecule primary structure. 
Hence, the term includes single and double stranded DNA as 
well as single and double stranded RNA. It also includes 
modified forms of the polynucleotide, e.g. by methylation, 
phosphorilation or "capping", and non modified forms. 

An «expression cassette for plants» relates to a 
recombinant polynucleotidic sequence obtained by linking 
together operatively various elements constituted by the 
polynucleotidic sequences that determine the in plant 
expression of a character and that are easily transferable 
as discrete constructs, from a vector to another by 
enzymatic restriction. 

A "vector" is a replicon to which another 
polynucleotidic fragment is added, in order to effect the 
replication and/or expression of the fragment itself. 

A "replicon" is any genetic element, for instance a 
plasmid, a chromosome, a virus, ' that behaves as an 
autonomous polynucleotidic replication unit inside a cell; 
therefore it can replicate autonomously. 

"regulation sequence'* refers to polynucleotidic 
sequences that are needed to effect the expression and/or 
the secretion of coding sequences to which they are bound. 
The nature of these regulation sequences differs depending 
on the host; in prokaryotes those regulation sequences 
usually include promoter, binding site of " ribosomes and 
terminators; in eukaryotes these regulation sequences 
usually include promoters, terminators and, in some cases, 
enhancers. In addition, in prokaryotes as well as in 
eukaryotes, leader sequences control the host cell 
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secretion of the expressed polypeptide . The term 
"regulation sequences" includes, at leasts ail components 
whose presence is required for expression, and may also 
include additional components whose presence is 
advantageous, for instance leader sequences . 

A «leader» sequence is a polynucleotidic fragment, 
usually short, encoding a transport signal of the protein 
fused thereto and leading the protein transfer into 
specific cellular compartments . If the transfer takes 
place through the endoplasmic reticulum the protein 
undergoes specific posttranscriptional modifications, 

"Operatively linked" relates to a juxtaposition 
wherein the above described components are in a relation 
enabling them to function in the expected way. A 
regulation sequence «operatively linked» to a coding 
sequence is linked in such a way that the coding sequence 
expression takes place in conditions that are compatible 
with the regulation sequences. 

AN open reading frame ^ ORF is a polynucleotidic 
sequence region encoding a polypeptide; this region can 
represent a portion of coding sequence or a complete 
coding secpience. 

A "coding sequence" is a polynucleotidic sequence 
that is transcripted in the mRNA and/or translated in the 
polypeptide when placed under control of appropriate 
regulation sequences . The ends of the coding sequence are 
determined by a translation start codon at 5' and by a 
translation stop codon at 3' . A coding sequence can 
include, without being limited to, mRNA, cDNA, and 
recombinant polynucleotidic sequences. 

"Recombinant host cells", "host cells", "cells", 
"cell 1 ines " , " cell cultures " and other terms indicating 
microorganisms or cell lines of superior eukaryotes , 
cultivated as unicellular entities, are used here in an 
interchangeable way. They relate to cells that can be, or 
have been, used as hosts for recombinant vectors or other 
transfer polynucleotides, including the progeny of the 
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cell that was originally transformed. It is implicit that, 
due to random or deliberate mutations, the progeny of a 
single parental cell need not necessarily be identical to 
the parental cell from a morphological and a genetic point 
of view. Progenies of the parental cell that are 
sufficiently similar to the ancestor cell and can be 
characterized for their salient capacity, as e.g., the 
presence of a nucleotidic sequence encoding the peptide of 
interest, are included in the progeny understood according 
to this definition and fall within the same terms. 

For «cell aggregation* a group of cells that are not 
structured in an organized tissue, but result from an 
undifferentiated proliferation of cells maintained in 
particular conditions of hormonal concentration. 

"Transformation" , as it is used here, refers to the 
insertion of a exogenous polynucleotide in a host cell, 
regardless of the method used for the insertion itself, 
e.g. direct acquisition, Agrobacterium infection, sexual 
reproduction. The exogenous polynucleotide can be 
maintained as a non integrated vector, for example a 
plasmid or, alternatively, it can integrate in the host 
genome . 

As it is used here, the term "polypeptide" relates to 
the amino acidic product of a sequence encoded inside a 
genome and does not relate to the specific length of the 
product: accordingly, peptides, oligopeptides and proteins 
are included in the definition «poiypeptide» This term 
does not relate to the post-expressional modifications of 
the peptide, as e.g. glycosilation, acetylation, 
phosphorilation, sialilation and the like. 

A "wild type polypeptide", has an amino acidic 
sequence identical to the one encoded in the genome of the 
organism source of the coding sequence. 

"Native lactof errin " and analogous terms relate to 
the lactoferrin isolated from the source in which it is 
usually produced in nature by a genome existing in nature. 

A "non-native polypeptide" " refers to a polypeptide 
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that is produced in a host differing from the one wherein 
it is produced in nature. 
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1. A polynucleotide coding for human lactoferrin, 
characterized in that it has a sequence totally or 
partially corresponding to the sequence reported in the 
annexed sequence listing as SEQ ID NO:l, said sequence 
being optimized for the in plant expression. 

2. The polynucleotide according to claim 1, wherein 
said polynucleotide has fused at the 5' -terminus the 
leader sequence of 7s basic globulin gene or the leader 
sequence of p-conglycinine gene 

3. The polynucleotide according to claim 2, wherein 
said leader sequence is the sequence reported in the 
annexed sequence listing as SSQ ID NO: 13 or the sequence 
reported in the annexed sequence listing as SEQ ID NO: 
14, 

4. A recombinant SNA vector conpriaing the 
polynucleotide according to any of claims 1 to 3 
operatively linked to regulation elements allowing the 
e^^ression of said polynucleotide - 

5. The recombinant DNA vector according to claim 4, 
wherein said regulacion elements is a plant expression 
cassette allowing the tissue specific expression of said 
polynucleotide. 

6 . The vector according to claim 5 , wherein said 
plant expression cassette includes the promoter of the 
gene coding for the protein basic globulin 7 S. 

7. The vector according to claim S, wherein said 
promoter has the sequence reported in the annexed 
sequence listing as SEQ ID N0i21 

8 . The vector according to claim € or 7, wherein 
said plant expression cassette includes the leader 
sequence of the gene coding for the protein basic 
globulin 7 S. 

3 . The vector according to claim a , wherein said 
leader sequence is the sequence reported in the annexed 
sequence listing as SEQ ID NO: 13. 
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10 The vector according to claim 5, wherein said 
plant expression cassette includes the promoter of the 
gene coding for |3-conglycinine protein. 

11 The vector according to claim 10, wherein said ■ 
promoter has the sequence reported in the annexed 
sequence listing as SEQ ID NOi22. 

12. The vector according to claim lO or 11, wherein 
said plant expression cassette includes the leader 
sequence of the gene coding for the p-conglycinine 

protein. , 

13 The vector according to claim 12, wherein said 

leader sequence is the sequence reported in the annexed 
sequence listing as SEQ ID NO: 14. 

14. A vegetal cell including the polynucleotxde 

according to any of claims 1 to 3 . ^ . ^ 

15. A vegetal cell including the vector according to 

any one of the claims 4 to 13 . 

IS. A cellular aggregation obtainable from cells 

according to claim 14 or IS. _ 

17 The cellular aggregation according to claim 16, 
wherein said aggregations are calluses capable of 
regenerating transgenic plants. 

- 18. A transgenic plant including in a tissue cell 
the polynucleotide according to any of claims 1 to 3. 

19. The transgenic plant according to claim 18, 
wherein said tissue cell is a storage tisst^e cell. 

20. The transgenic plant according to claim 18, 
wherein said tissue cell is a fruit tissue cell. ^ 

21 The transgenic plant according to any of claims 
IBsto 20, said plants being selected from the group 
consisting of solanaceae, cereals, leguminosae, fruit 
bearing plants and horticultural plants. 

22 The transgenic plant according to claim 21, said 
plant being selected from the group consisting of Soya, 

tobacco and rice. 

23. use of the vector according to any of claims 4 
to 13 for the trjana format ion of vegetal cells. 
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24 use of the transgenic plant according to any one 
of the olaima 18 to 22. for the production of 

natriceuticals . 

25 use oE the transgenic plant according to any of 
5 claims 18 to 22, for the production of human lactofemn. 

2S use of the transgenic plant according to any one 
of claim 18 to 22, for the production of lactoferrin 
flours or of lactoferrin e:c,:racts obtained from tissues 
of said transgenic plant. 
,0 27 use of the transgenic plant according to any one 

' of the claims 18 to 22 for the production of functional 
foods containing lactoferrin. 

28 The use according to claim 21, wherein saxd 
functional food is selected from the group consisting of 
15 vegetal milks, fruit juices, fruit and/or vegetable 

homogenized foods. n -sue 

29. A plant expression cassette allowing "^^e tissue 
specific expression of a gene of interest -'^--^'^^Jf 
promoter of the gene coding for the protexn basic 
.0 globulin T^S.^^^^ ^^^^^^^^ ^^^^^^^^ 

29, Wherein said promoter has the sequence reported m 
the annexed sequence listing as SEQ ID NO =21 

31. The plant e=<pression cassette according to claim 
25 29 or 30, wheredn said plant e=cpression cassette includes 
the lead:er se<juence of the gene coding for the protein 

""^r°Si"it-e:.ression cassette according to claim 
31, Wherein saiJd leader secjuence is the sequence reported 

30 as SEQ ID KO; 13- ^- 
33 A recombinant DSDV vector comprising a gene of 
interest under the control of the plant expression 
cassette according co any of claims 29 to 32 . 

IT ?he vector according to claim 33 when depending 
35 on claim 31 or 02, wherein said gene of interest is fused 
to the leader sequence. 
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03^1-2000 / ^^^.'-;^^:^:'\^y.:f:^^^^^ ; 

3s. ^ plant :':„:t;tnUetrs^^^^^^ 

specific -P---- for the p-=o^ly=i-- 

protnoter of tae g« 

protein. cassette according to claim 

3S. The plant ^^^^^^ ,.^ence reported in 

35, wherein said promoter loa 

t.e annexed se^^nce ^^^^f J^/.S,,, ...ordina to cXai. 

3,. The plant ^^/"^^^f ^;,,,3ion cassette includes 
3S or 36, Wherein saxd pl-t ^P^ ^^^^^ ,^,^3, 

10 the leader ^e^^; ^in. for the p-conglycinxne 
sequence o£ tne a 

g protein. cassette according to claim 

a 38. The plant e^cpression casse ^^p^rted 

- leader sequence is the ssh 

* 37, ,*erein said leaaer 

i IS as SEQ ID N0= 14- comprising a gene of 

fU 39. A recombinant ^ plant expression 

interest under the c^.r^ ot j^^^ ^ 33. 
p cassette according to 33 ^en depending 

I «. The vector interest is fused 

5 20 on claim 37 or 33, wherein said gen 

g „ th. leader science ^^^^^^ ..cording to 

41. A vegetal celJ- m 

°::.irii:ira4Ut.on o^talnahle .rom the cell 

25 according to claim 41. according to claim 42, 

43. The cellular ^^^regat-^n regenerating 
said aggregations being calluses 
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transgenic plants. including in a tissue cell 

A transgenic J 33, 34, 39 or 40. 

vector according to any o^ <= ^^^.^ ,i,i„ 44, 

45. The transgenic ^^^^^e cell- 

therein said tissue cell ^= ^ raccording to claim 44, 

4S. The transgenic P^^'^,^ ,i3sue cell, 
wherein said tissue cells is a ^^^^.^^ ^f claim 

47. The transgenic plant ^j,^ group 

,o 45, said plants being selec 
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43 to 45, 
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consisting of solanaceae, cereals, legundnosae, fruit 
bearing plants and horticultxiral plants. 

48. The transgenic plant according to claim 47, said 
plant being selected from the group consisting of Soya, 

5 tobacco and rice. 

49. Use of- the vectors according to claims 33, 34, 
39 or 40 for th4 transformation of vegetal cells. 
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Accordingly, the amendments involved are those filed with the application papers or, 
in the case of a supplemental declaration, are those amendments claiming matter not 
encompassed in the original statement of invention or claims. See 37 CFR 1.67. 

( c ) [ X ] was described and claimed in 

POT International Application No. PCT/IT99/002g6 

filed on 19 July 1999 as amended under PCT Article 19 (1) 

on (if any). 
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ACKNOWLEDGMENT OF REVIEW OF PAPERS AND DUTY OF CANDOR 

* 

I hereby state that I have reviewed and understand the contents of the above identified 
specification, Including the claims, as amended by any amendment referred to above. 

I acknowledge the duty to disclose information which is material to patentability as defined 
in Title 37, Code Federal Regulations § 1.56. 

[ ] In compliance with this duty there is attached an information disclosure statement 
37 CFR 1 .97. 

PRIORITY CLAIM 

I hereby claim foreign priority benefits under Title 35, United States Code, § 119 of any 
foreign applicatlon(s) for patent or inventor's certificate or of any PCT international 
application(s) designating at least one country other than the United States of America listed 
below and have also identified below any foreign applications(s) for patent or inventor's 
certificate or any PCT international application(s) designating at least one country other 
than the United States of America filed by me on the same subject matter having a filing date 
before that of the appllcatlon{s) of which priority is claimed. 

(complete (d) or (e)) 

(d) [ ] no such applications have been filed. 

( e ) [X] such applications have been filed as follows. 

NOTE: Where item (c) is entered above and the International Application which designated 
the U.S. claimed priority check Item (e), enter the details below and make the 
priority claim. 



EARLIEST FOREIGN APPLICATION(S), IF ANY, FILED WITHIN 12 MONTHS 
(6 MONTHS FOR DESIGN(S)) PRIOR TO THIS U.S. APPLICATION 



COUNTRY 


APPLICATION NUMBER 


DATE OF FlUNG 

{day, month, year) 


PRIORITY CLAIMED 
UNDER 37 use 119 


ITALY 


RM98A0Q0478 


17 July 1998 


[ X ] YES [ ] NO 








[ ] YES [ ] NO 








[ ]YES [ ]NO 








[ ]YES ( ]N0 








[ ]YES [ ]N0 



ALL FOREIGN APPLICATrON(S), IF ANY FILED MORE THAN 12 MONTHS 
(6 MONTHS FOR DESIGN(S)) PRIOR TO THIS U.S. APPLICATION 
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POWER OF ATTORNEY 



As a named inventor, I hereby appoint the following attorney(s) and/or agent(s) lo prosecute this 
application and transact all business In the Patent and Trademark Office connected therewith. (List 
name and registration number) 



Richard P. Berg, Reg. No. 28,145 
Mavis S. Gallenson, Reg. No. 32,464 
Kam C. Louie, Reg. No. 33,008 
Ross A, Schmitt, Reg. No. 42,529 



Victor Repkin, Reg. No. 45,039 
John Palmer, Reg. No. 36,885 
Peter D, Galloway, Reg. No. 27, 885 
William R. Evans, Reg. No. 25, 858 



(checl< the following item, if applicable) 
[ ] Attached as part of this declaration and power of attorney is the authorization of the 
above-named attorney(s) to accept and follow instructions from my representative(s). 



SEND CORRESPONDENCE TO: DIRECT TELEPHONE GALLS TO: 

(Name and telephone number) 

Richard P. Berg, Esq. 

do LADAS & PARRY Richard P. Berg 

5670 Wilshire Boulevard, Suite 2100 

Los Angeles, California 90036-5679 (323) 934-2300 



DECLARATION 

1 hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code, and that such willful false statements may jeopardize the validity of the application or any 
patent Issued thereon. 

SIGNATURE(S) 

Full name of sole or first inventor Corrado FOGHER 

Inventor's signature 

Date Country of Citizenship ITALY 

Residence ViaTicino. 32. 1-26041 Casafmagaiore. ITALY 

Post Office Address (same as residence) 

Full name of second Joint Inventor, if any 

Inventor's signature 

Date Country of Citizenship 

Residence 

Post Office Address 
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CHECK PROPER BOX(ES) FOR ANY OF THE FOLLOWING ADDED PAGES(S) 
WHICH FORM A PART OF THIS DECLARATION 



[ ] Signature for third and subsequent joint inventors. Number of pages added 

[ ] Signature by administratQr(trix}, executor(trix) or legal representative for deceased or 
incapacitated inventor. Number of pages added 

[ ] Signature for inventor who refuses to sign or cannot be reached by person authorized 

under 37 CFR 1 .47. Number of pages added Added pages to combined declaration and 
power of attorney for divisional, continuation-in-part (CIP) application. 
Number of pages added 



[ ] Authorization of attorney(s) to accept and follow instructions from representative. 

* * 



// no furifier pages form a part of this 
this page arid check the following item. 

[ X JThis declaration ends with this page. 



Declaration then end this Declaration with 
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PATENT 

Attorney's Docket No, B-4Q75PCT 618484-4 



COMBINED DECLARATION AND POWER OF ATTORNEY 

(ORIGINAL, DESIGN. NATIONAL STAGE OF PCT. SUPPLEMENTAL. DIVISIONAL. CONTINUATION, OR CIP) 



As a below named inventor, i hereby declare that: 

TYPE OF DECLARATION 

This declaration is of the following type: (check one applicable item below) 
[X] original 
[ ] design 
[ ] supplemental 

NOTE: If the declaration is for an International Application being filed as a divisional, continuation or 
continuation-in-part application, do not check next item; check appropriate one of last ttirce items. 

[X] national stage of PCT 
NOTE: If one of the following 3 items apply, then complete and also attach ADDED PAGES FOR 
DIVISIONAL, CONTINUATION, OR CIP. 

[ ] divisional 

[ ] continuation 

[ 1 continuation-in-part (CIP) 



■as- 



^ INVENTORSHIP IDENTIFICATION 

"Si WARNING- If the inventors are each not the inventors of all the claims an explanation of the facts, 
:=£ including the ownership of all the claims at the time the last claimed invention was 

t. ! made, should be submitted. 

My residence, post office address and citizenship are as stated below/ next to my name. I believe I am 
fU the original first and sole inventor (if only one name is listed below) or an original, first and joint inventor 

W (if plural names are listed below) of the subject matter which is claimed and for which a patent is sought 

:s on the invention entitled: 

TITLE OF INVENTION 



A SYNTHETIC POLYNUCLEOTIDE CODING FOR HUMAN LACTOFERRIN, VECTORS, CELLS AND 
TRANSGENIC PLANTS CONTAINING IT . 



SPECIFICATION IDENTIFICATION 

the specification of which: (complete (a), (b) or (c)) 

(a) [ ] is attached hereto. 

(b) [...] was filed on as [ ] Serial No. 



or [ ] Express Mail No., as Serial No. not yet known. 

and was amended on . (if applicable). 

NOTE- Amendments filed after the original papers are deposited with the PTO which contain new 

matter are not accorded a filing date by being referred to in the declaration. Accordingly, the 
amendments involved are those filed with the application papers or, in the case of a 
supplemental declaration, are those amendments claiming matter not encompassed In the 
original statement of invention or claims. See 37 CFR 1.67. 

(c) [X 1 was described and claimed in PCT International Application No. 

PCT/1T99/0D226 . ^. , 

filed on 19 July 1999 as amended under PCT Article 19 (1) 
on (if any) and identified as U.S. Serial No. 09/71-3,823 
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ACKNOWLEDGM KNT OF REVIEW OF.PAPERS AND DUTY OF CANDOR 



I hereby state that I have reviewed and understand the contents of the above identified specification, 
including the claims, as annended by any amendment referred to above. 

I acknowledge the duty to disclose information which is material to patentability as defined in Title 37, 
Code Federal Regulations § 1.56. 

[ ] In compliance with this duty there is attached an infomiation disclosure statement 37 
CFR 1.97. 

PRIORITY CLAIM 

1 hereby claim foreign priority benefits jnder Title 35, United States Code, § 119 of any foreign 
application (s) for patent or inventor's cerUficate or of any PCX international application{s) designating at 
least one country other than the United States of America listed below and have also identified below 
any foreign applications(s) for patent or inventor's certificate or any PCT international application(s) 
designating at least one country other than the United States of America filed by me on the same 
subject matter having a filing date before that of the application (s) of which priority is claimed. 

(complete (d) or (e)) 

(d) [ 1 no such applications have been filed. 

(e) [X] such applications have been filed as follows. 

NOTE: Where item (c) is entered above ar}d ttie International Application which designated the U.S. 
claimed priority check item (e), enter the details below and make the priority claim. 



EARLIEST FOREIGN APPLICATION(S), IF ANY, FILED WITHIN 12 MONTHS 
(6 MONTHS FOR DESIGN(S)) PRIOR TO THIS U.S. APPLICATION 



COUNTRY 


APPLICATION NUMBER 


DATE OF FILING 


PRIORITY CLAIMED 




(day, month, year) 


UNDER 37 use 119 


ITALY 


RM98A000478 


17 July 1998 


[x]YES []N0 








[lYES []N0 








[]YES [] NO 








[ ] YES [ ] NO 








[]YES []NO 



ALL FOREIGN APP LIGATION (S), IF ANY FILED MORE THAN 12 MONTHS 
(6 MONTHS FOR DESIGN(S)) PRIOR TO THIS U.S. APPLICATION 
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CHECK PROPER BOX(ES) FOR ANY OF THE FOLLOWING ADDED PAGES(S) 
WHICH FORM A PART OF THIS DECLARATION 

[ ] Signature for third and subsequent joint inventors. Number of pages aclded_ 

[ ] Signature by administrator(trix), executor(trix) or legal representative for deceased or 
incapacitated inventor. Number of pages added _____ 

[ ] Signature for inventor who refuses to sign or cannot be reached by person authorized 
under 37 CFR 1 .47. Number of pages added Added pages to combined declaration and power of 
attorney for divisional, continuation-in-part (CIP) application. 
Number of pages added _ 



[ ] Authorization of attorney(s) to accept and follow instructions from representative. 



If no furtfjer pages form a part of this Declaration then end this Declaration with this page and 
check the following item. 

[X] This declaration ends with this page. 
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POWER OF ATTORNEY 



As a named inventor, 1 hereby appoint the following attorney(s) and/or agent(s) to prosecute tliis 
application and transact all business in the Patent and Trademarit Office connected therewith. (List 
name and registration number) 

Richard P. Berg. Reg. No. 28,1_45 Victor Repkin, Reg. No^ 45,p3£ 

Mavis S. Gallenson, Reg. No. 3242;^ John Palmer, Reg. No.J36j8_85 
Kam C. Louie, Reg. No. 3ajm Peter D. Galloway. Reg. No. 2L_885 

William C. Boling, Reg. No. 41,fi^ William R. Evans, Reg. No. 25, 858 
Ross A. Schmitt, Reg. No. ^.^^ 

(check the following item, if applicable) 
[ ] Attached as part of this dedaration and power of attorney Is the authorization of the above- 
named attorney(s) to accept and follow instructions from my representative(s). 



SEND CORRESPONDENCE TO: DIRECT TELEPHONE CALLS TO: 

(Name and telephone number) 

Richand P. Berg, Esq. 

c/o LADAS & PARRY Richard P. Berg 

5670 Wilshire Boulevard, Suite 2100 

Los Angeles. California 90036-5679 (323) 934-2300 



DECLARATION 

I hereby declare that all statements made herein of my own knowledge are true and that all statemente 
made on information and belief are believed to be true; and further that these statements were made 
with the knowledge that willful false statements and the like so made are punishable by fine or 
imprisonment, or both, under Section 1 001 of Title 1 8 of the United States Code, and that such willful 
false statements may jeopardize the validity of the application or any patent issued thereon. 

SIGNATURE(S) 

Full name of sole or first inventor Corrado FOGHER 

Inventor's signature — t'TT" ^ • 

Date g. dfcftCtI 200*1 Cergntry of Ciazenship_lIALY 

Residence Casalmaqaiore CR. ITALY ^J...XT~^ _ ^ 

Post nffir.fi Address Via Ticino 32 26041 'Casafmagaiore OR ITALY 



Full name of second joint inventor, if any 
Inventor's signature . 



Date Country of Citizenship, 

Residence 

Pnfit Office Address _ 



Full name of third joint inventor, if any 

Inventor's signature 

Date Country of Citizenship 

Residence 

Post Office Address 
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09/743823 

WOOO/04146 ^ / j6 PCT/IT99/00226 - 

JC07Rec'dPCT/PTO 1 6 JAN ffl 

SEQUENCE LISTING 
GENERAL INFORMATION: 
(x) APPLICANT: PLANTECHNO S.R.L. 
(ii) TITLE OF INVENTION: 

SYNTHETIC POLYNUCLEOTIDE ENCODING HUMAN LACTOFERRIN, 
VECTORS, CELLS AND TRANSGENIC PLANTS CONTAINING IT 
(iii) NUMBER OF SEQUENCES: 22 
{iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Societa Italiana Brevetti 

(B) STREET: .Piazza di Pietra, 39 

(C) CITY: Roma 

(D) COUNTRY: Italy 

(E) POSTAL CODE: 1-0018 6 
(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS Rev. 5.0 

(D) SOFTWARE: Microsoft Word 6.0 

[viii) ATTORNEY INFORMATION 

(A) NAME: LEONE, Mario (Eng.) 

(B) REFERENCE: PC-EBR 

(ix) TELECOMMUNICATION INFORMATION 

(A) TELEPHONE: 06/695441 

(B) TELEFAX: 06/69544830 

(C) TELEX: 612287 ROPAT 
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(1) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 2079 bp 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(ix) FEATURES 

(A) NAME: LFUSYN 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GGC CGT AGG AGA AGG AGT GTT CAA TGG TGC GCA GTA TCA CAA CCA 45 
G R R RRSVQWCAVSQ P 

GAG GCC ACA AAA TGC TTC CAA TGG CAA AGG AAT ATG AGA AAA GTT 90 
EATKCFQWQRNMRKV 

CGT GGA CCT CCT GTA TCT TGC ATA AAG AGA GAT TCA CCC ATC CAG 135 
RG PPVSCIKRDSPIQ 

TGT ATC CAG GCA ATT GCG GAA AAC AGA GCT GAT GCT GTG ACT CTT 180 
C I Q AIAENRADAVT L 

GAT GGT GGT TTC ATA TAC GAG GCA GGA CTT GCC CCA TAC AAA CTG 225 
DGGFIYEAGLA-PYKL 

CGA CCT GTA GCG GCG GAA GTC TAC GGG ACC GAA AGA CAA CCA CGA 270 
R P V AAEVYGTE RQ P R 

ACT CAC TAT TAT GCT GTG GCT GTT GTG AAG AAG GGC GGA TCT TTT 315 
TH YYA7AVVKKGGSF 

CAG CTG AAC GAA CTT CAA GGT CTG AAG TCA TGC CAC ACA GGA CTT 360 
QLN ELQGLKSCHTGL 

CGC AGG ACC GCT GGA TGG AAT GTC CCT ATA GGG ACA CTT CGT CCA 405 
RR T AGWNVPIGTLRP 
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TTC TTG AAT TGG ACG GGT CCA CCT GAG CCC ATT GAG GCA GCT GTG 4 50 
FLNWTGPPEP I EAAV 



GCA AGA TTC TTC TCA GCC TCT TGT GTT CCA GGT GCA GAT AAA GGA 4 95 
A. RFFSAS CVPGA D KG 



CAS. TTC CCC AAC CTT TGT CGC CTG TGT GCG GGG ACA GGG GAA AAC 540 
QFPNLCRLCAGT G EN 

AAA TGT GCA TTC TCA TCC CAG GAA CCG TAC TTC AGO TAG TCT GGT 585 
KCAFSSQEPYFSYSG 



GCC TTT AAG TGT CTT AGA GAC GGT GCT GGA GAT GTT GCT TTT ATT 630 
AFKCLRDGAGDVAFI 

AGA GAG AGC ACA GTG TTT GAG GAT CTT TCA GAC GAG GCT GAA AGG 675 
RESTVFEDLSDEAER 



GAC GAG TAT GAG TTA CTC TGC CCA GAC AAC ACT CGT AAG CCA GTT 720 
DEYELLCPDNTRKPV 



GAC AAG TTC AAA GAT TGC CAT CTT 
DKFKDCHL 

GTT GTG GCA CGA AGT GTT AAT GGA 
VVARSVNG 

CTT CTC CGC CAA GCA CAG GAA AAG 
LLRQAQEK 

AAA TTC CAG CTC TTT GGT TCC CCT 
KFQLFGSP 

TTC AAG GAC TCT GCC ATT GGG TTT 
FKDSAIGF 



GCA CGG GTC CCT TCT CAT GCC 765 
A R V P S H A 

AAG GAG GAT GCC ATC TGG AAT 810 
K E D A I W N 

TTT GGA AAG GAC AAG TCA CCG 855 
F G K D K S P 

AGT GGG CAG AAA GAT CTT CTG 900 
S G Q K D L L 

TCG AGA GTG CCA CCT AGG ATA 945 
S R V P P R I 
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GAT TCT GGG TTG TAG CTT GGC TCC GGA TAC TTT ACT GCA ATT CAG 990 
DSGLYLGSGYFTAIQ 

AAC TTG AGG AAA AGT GAG GAG GAA GTT GOT GCC CGG CGT GCG GGG 1035 
N L R K SEE EVAARRAR 

GTC GTT TGG TGT GCG GTG GGA GAG CAA GAG TTG CGC AAG TGT AAC 1080 
VVWCAVGEQELRKCN 

CAG TGG AGT GGT TTG AGO GAA GGA TCT GTG ACC TGG TCA TCG GCC 1125 
QWS GLSEGSVTCSSA 

TCC ACT ACA GAA GAT TGC ATC GCC CTG GTG TTG AAA GGA GAA GCT 1170 
STTEDCIALVLKGEA 

GAT GCC ATG AGT TTG GAT GGA GGA TAT GTT TAC ACT GCA GGT AAA 1215 
.DAMSLDGGYVYTAGK. 

TGT GGT TTG GTG CCT GTC CTT GCA GAG AAC TAC AAA TCA CAA CAA 1260 
CGLVPVLAENYKSQQ 

AGC AGT GAC OCT GAT CCT AAC TGT GTG GAT AGA CCT GTG GAA GGA 1305 
SSDPDPNCVDRPVEG 

TAT CTT GCT GTG GCG GTG GTT AGG AGA TCA GAC* ACT AGC CTT ACC 1350 
YL AVAVVRRS DTSLT 

TGG AAC TCT GTG AAA GGC AAG AAG TCC TGC CAC ACC GCC GTG GAC 1395 
HNSVKGKKSCHTAVD 

AGG ACT GCA GGT TGG AAT ATC CCC ATG GGA TTG CTG TTC AAC CAG 1440 
RTAGWNIP MGLLFNQ 

ACG GGC TCC TGC AAA TTT GAT GAA TAT TTC AGT CAA AGC TGT GCC 1485 
TGSCKFDEYFSQSCA 

CCT GGT TCT GAC CCA AGA TCT AAT CTC TGT GCT TTG TGT ATT GGA 1530 
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PG SDPRSN LCALCI G 

GAT GAG CAA GGT GAG AAT AAG TGC GTT CCC AAC AGC AAC GAG AGA 1575 
D EOGE NKCV P NS N E R 

TAC TAG GGT TAG ACT GGG GCT TTC CGT TGC TTG GGT GAG AAT GOT 1620 
YYGYTGAFRCLAEMA 

GGA GAC GTT GCA TTT GTG AAA GAT GTC ACT GTC TTG GAG AAC ACT 1665 
GDVAFVKDVTVLQN T 

GAT GGA AAT AAC AAT GAG GCA TGG GCT AAG GAT TTG AAG CTT GCA 1710 
DGNNNEAWAKDL KLA 

GAC TTT GCG TTG CTG TGC CTC GAT GGC AAA CGT AAG CCT GTG ACT 1755 
DFALLCLDGKRKPVT 

GAA GCT AGA AGC TGC CAT CTT GCC ATG GCC CCG AAT CAT GCT GTG 1800 
EARSCHLAMAPN HAV 

GTG TCT CGT ATG GAT AAG GTG GAA CGC TTG AAA CAG GTG TTG CTC 1845 
VSRMDKVERLKQVLL 

CAC CAA CAG GCT AAA TTT GGT AGA AAT GGA TCT GAC TGC CCG GAC 1890 
HQQAKFGRNGSDC P D 

AAG TTT TGC TTA TTC CAG TCT GAA ACC AAA AAC CTT TTG TTC AAT 1935 
K F CLFQSE TKN L LF N 

GAC AAC ACT GAG TGT CTT GCC AGA CTC CAT GGC AAA AGA ACA TAT 1980 
DNTECLARLHGKT TY 

GAA AAA TAT TTG GGA CCA CAG TAT GTC GCA GGC ATT ACT AAT CTG 2025 
EKYLGPQYVAGI TNL 

AAA AAG TGC TCA ACC TCC CCA CTC CTA GAA GCC TGT GAA TTC CTA 2070 
KKCSTSPLLEACE FL 
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AGG AAG TAA 
R K * 



2079 



(2} INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 30 bp 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECOLE TYPE: oligonucleotide 
( ix ) FEATURES 

{A} NAME: PLT 46 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GGATCCATGG GCCGTAGGAG AAGGAGTGTT 30 

(3) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 32 bp 

{B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 
(ix) FEATURES 

(A) NAME: PLT 47 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GAGCTCCTTC GGTTTTACTT CCTGAGGAAT TC 32 

(4) INFORMATION FOR SEQ ID NO: 4: 
(X) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 42 bp 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: oligonucleotide 
(ix) FEATURES 
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(A) NAME: PLT 4 8 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

TCTAGATAAA ATAATCTATA CATTAAAAAA TTTGATTTTA AA 42 



(5) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 36 bp 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 
(ix) FEATURES 

(A) NAME: PLT 4 9 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GGATCCGACT GAGTCGGATA AGAAGAAAAG AAAAGA 36 

(6) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 36 bp 

CB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 
(ix) FEATURES 

(A) NAME: PLT 50 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

TCTAGAGTTT TCAAATTTGA ATTTTAATGT GTGTTG 36 

(7) INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 36 bp 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: oligonucleotide 
(ix) FEATURES 

(A) NAME: PLT 51 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GGATCCCACC TTAAGGAGGT TGCAACGAGC GTGGCA 36 

(8) INFORMATION FOR SEQ ID N0:8: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 250 bp 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 
(ix) FEATURES 

(A) NAME: PLT 120 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GGC CGT AGG AGA AGG AGT GTT CAA TGG TGC GCA GTA TCA CAA CCA GAG 
GCC ACA AAA TGC TTC CAA TGG CAA AGG AAT ATG AGA AAA GTT CGT GGA 
CCT COT GTA TCT TGC ATA AAG AGA GAT TCA CCC ATC CAG TGT ATC CAG 
GCA ATT GCG GAA AAC AGA GCT GAT GOT GTG ACT CTT GAT GGT GGT TTC 
ATA TAC GAG GCA GGA CTT GCC CCA TAC AAA CTG CGA CCT GTA GCG GCG 
GAA GTC TAC G 

(9) INFORMATION FOR SEQ ID N0:9: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 250 bp 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY : linear 
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(ii) MOLECULE TYPE: oligonucleotide 
(ix) FEATURES 

(A) NAME: PLT 121 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GC hCC TGG AAC ACA AGA GGC TGA GAA GAA TCT TGC CAC AGC TGC CTC 
AAT GGG CTC AGG TGG ACC CGT CCA ATT CAA GAA TGG ACG AAG TGT CCC 
TAT AGG GAC ATT CCA TGC AGC GGT CCT GCG AAG TCC TGT GTG GCA TGA 
CTT CAG ACC TTG AAG TTC GTT GAG CTG AAA AGA TCC GCC CTT CTT CAC 
AAC AGC CAC AGC ATA ATA GTG AGT TCG TGG TTG TCT TTC GGT CCC GTA 
GAC TTC CGC CG 

(10) INFORMATION FOR SEQ ID NO:10: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 250 bp 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: oligonucleotide 
(ix) FEATURES 

(A) NAME: PLT 122 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

AAC TGG CTT ACG AGT GTT GTC TGG GCA GAG TAA CTC ATA CTC GTC 
CCT TTC AGC CTC GTC TGA AAG ATC CTC AAA CAC TGT GCT CTC TCT 
AAT AAA AGC AAC ATC TCC AGC ACC GTC TCT AAG ACA CTT AAA GGC 
ACC AGA GTA GCT GAA GTA CGG TTC CTG GGA TGA GAA TGC ACA TTT 
GTT TTC CCC TGT CCC CGC ACA CAG GCG ACA AAG GTT GGG GAA TTG 
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TCC TTT ATC TGC ACC TGG AAC ACA A 

(11) INFORMATION FOR SEQ ID N0:11: 
(x) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 255 bp 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 
Ux) FEATURES 

(A) NAME: PLT 123 
{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GTA CAA CCC AGA ATC TAT OCT AGG TGG CAC TCT CGA AAA CCC AAT GGC 
AGA GTC CTT GAA CAG AAG- ATC TTT CTG CCC ACT AGG GGA ACC AAA GAG 
CTG GAA TTT CGG TGA CTT GTC CTT TCC AAA CTT TTC CTG TGC TTG GCG 
GAG AAG ATT CCA GAT GGC ATC CTC CTT TCC ATT AAC ACT TCG TGC CAC 
AAC GGC ATG AGA AGG GAC CCG TGC AAG ATG GCA ATC TTT GAA CTT GTC 
AAC TGG CTT ACG AGT 

[12} INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 251 bp 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 
Cix) FEATURES 

(A) NAME: PLT 124 
(xt) SEQUENCE DESCRIPTION: SEQ ID N0:12: 
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TA TCC TCC ATC CAA ACT CAT GGC ATC AGC TTC TCC TTT CAA CAC CAG 
GGC GAT GCA ATC TTC TGT AGT GGA GGC CGA TGA GCA GGT CAC AGA TCC 
TTC GCT CAA ACC ACT CCA CTG GTT ACA CTT GCG CAA CTC TTG CTC TCC 
CAC CGC ACA CCA AAC GAC CCG CGC ACG CCG GGC AGC AAC TTC CTC CTC 
ACT TTT CCT CAA GTT CTG AAT TGC AGT AA-A GTA TCC GGA GCC AAG GTA 
CAA CCC AGA ATC 

(13) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 75 bp 
{B).TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 
(ix) FEATURES 

(D) OTHER INFORMATION: leader sequence of protein 7S 
basic globulin 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:13: 
ATGGCTTCTA TCCTCCACTA CTTTTTAGCC CTCTCTCTTT CTTGCTCTTT 50 
TCTTTTCTTC TTATCCGACT CAGTC 75 

(14) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 191 bp 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 
(ix) FEATURES 

(D) OTHER INFORMATION: leader sequence of protein ji- 
conglycinine 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
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ATGATGAGAG CGCGGTTCCC ATTACTGTTG CTGGGAGTTG TTTTCCTAGC 50 

ATCAGTTTCT GTCTCATTTG GCATTGCGTA TTGGGAAAAG CAGAACCCCA 100 

GTCACAACAA GTGCCTCCGA AGTTGCAATA GCGAGAAAGA CTCCTACAGG 150 

AACCAAGCAT GCCACGCTCG TTGCAACCTC CTTAAGGTG 189 

(15} INFORMATION FOR SEQ ID N0:15: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 250 bp 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 
(ix) FEATURES 

(A) NAME: PLT 125 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GAGCAATCCC ATGGGGATAT TCCAACCTGC AGTCCTGTCC ACGGCGGTGT -50 
GGCAGGACTT CTTGCCTTTC ACAGAGTTCC AGGTAAGGCT AGTGTCTGAT 100 
CTCCTAACCA CCGCCACAGC AAGATATCCT TCCACAGGTC TATCCACACA 150 
GTTAGGATCA GGGTCACTGC TTTGTTGTGA TTTGTAGTTC TCTGCAAGAC 200 
AGGCACCAAA CCACATTTAC CTGCAGTGTA AACATATCCT CCATCCAAAC 250 

(16) INFORMATION FOR SEQ ID N0:16: 

(i) SEQUENCE CHARACTERISTICS 

{A) LENGTH: 254 bp 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 
(ix) FEATURES 

(A) NAME: PLT 126 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

CCATCAGTGT TCTGCAAGAC AGTGACATCT TTCACAAATG CAACGTCTCC 50 
AGCATTCTCA GCCAAGCAAC GGAAAGCCCC AGTGTAACCG TAGTATCTCT 100 
CGTTGCTGTT GGGAACGCAC TTATTCTCAC CTTGCTCATC TCCAATACAC 150 
AAAGCACAGA GATTAGATCT TGGGTCAGAA CCAGGGGCAC AGCTTTGACT 200 
GAAATATTCA TCAAATTTGC AGGAGCCCGT CTGGTTGAAG AGCAAGCCCA 250 
TGGG 254 



wo 00/04146 



13 / 16 



PCT/IT99/00226 



(17) INFORHATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 229 bp 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 
(ix) FEATURES 

(A) NAME: PLT 127 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GCAGTCAGAT CCATTTCTAC CAAATTTAGC CTGTTGGTGG AGCAACACCT 50 
GTTTCAAGCG TTCCACCTTA TCCATAGGAG ACACCACAGC ATGATTCGGG 100 
GCCATGGCAA GATGGCAGCT TCTAGCTTCA GTCACAGGCT TACGTTTGCC 150 
ATCGAGGCAC AGCAACGCAA AGTCTGCAAG CTTCAAATCC TTAGCCCATG 200 
CCTCATTGTT ATTTCCATCA GTGTTCTGC 229 

(18) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 210 bp 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 
(ix) FEATURES 

(A) NAME: PLT 128 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TTACTTCCTT AGGAATTCAC AGGCTTCTAG GAGTGGGGAG GTTGAGCACT 50 
TTTTCAGATT AGTAATGCCT GCGACATACT GTGGTCCCAA ATATTTTTCA 100 
TATGTTGTTT TGCCATGGAG TCTGGCAAGA CACTCAGTGT TGTCATTGAA 150 
CAAAAGGTTT TTGGTTTCAG ACTGGAATAA GCAAAACTTG TCCGGGCAGT 200 
CAGATCCATT 210 

(19) INFORMATION FOR SEQ ID N0:19: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 30- bp 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 
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(ix) FEATURES 

(A) NAME: PLT 129 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GGATCCATGG GCCGTAGGAG AAGGAGTGTT 30 

(20) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CfiARACTERISTICS 

(A) LENGTH: 28 bp 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: s.ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 
(ix) FEATURES 

(A) NAME: PLT 120 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
GAGCTCTTAC TTCCTTAGGA ATTCACAG 28 

(21) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: bp 

CB) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(ix) FEATURES 

(D) OTHER INFORMATION: sequence of promoter of gene 
encoding 7S basic globulin 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:21: 

TAAAATAATC TATACATTAA AAAATTTGAT TTTAAAATTT TAGAAATTCA 50 
TGATTTTATT TTTTTTTACC AGAAATCCGT TAATATTGTT AAAATATTAC 100 
CAACTAATTT ATAAATTTTA TTTTAAGGCA ATTAAGCATG TTTGATAAAA 150 
TATATATATT GTTATAAATA CTTTTCAAAA GTATAAAGTT GATGATGGCG 200 
TGGTGGTAGA TTATTTTAGT TCTAGGTTCG AATGCAAGTT GGTTTAGACA 250 
TTTAGCCTTA TTCTTTTTTC TAACCAAAAT AAATGTAAAT GGAAAACCTT 300 
TAGGAAAAAA AAGAAATCAA AATTGAAAAC ATCATCCGGT GGAGTCGAGA 350 
AGCCCACACC CACGTGACCC AACAATATTA AAATAAGAGT TTGCTCTACA 400 
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GTAAATGCGA 


TAG TT TTTT A 


TTCAATAPTT 


TTTr'par'TTr* 

XXX ^i_iriv_ X X ^ 




4dU 


GAGATTTGCA 


CCGTTAACTA 


£^ X X ^TJT.O X W X X 




^n^^^n^TV H 7\ T\ T\ 




AACTTGTGTA 


CCGTGCCTCA 


fATTTCaaPT 




i ijAAACaCCGT 


c c 

550 


TATGTTTAGG 


TTAGTGTTTf^ 


fa Afaf^TTrza 




i CAGGAGGCT 


600 


ACTTGGTCTT 


w^w ± ^ X -L IJ^^VJ J, 


v,1XXXIj11 ^ a 


Ax i X 1 ifJACG 


TGATTTTGTT 


650 


GGTGAACACG 




Zi r* T T 71 TT B T I\ 




TTTTATAAGT 


0 

700 


TTCACTTCTT 


ATATAATACT 


pATaTaaTaT 


7\ Tn^/^f^TTTTV 


CsAATGCCAAT 


750 


TTTTAAAAAA 


AGAATAAAAA 


aaTaaaTapa 


7\ T a aa 71 Tr'pTi 




a UU 


TGTAAAAAAT 


TTGAGGGGGA 




n. X lartA/i^j X d 


Ax i Ai 1 1 AAA 


S50 


TTTTCCATTA 


GAATTCTATT 


TTPPTTanTT 

X X ^ ^-^ X X X 1 


a aTaT/**a rT'T 


AfjCCAtj i i GG 


yuo 


GAGATACACG 


a a a a Tr^Tr* a t 


ca 2i ar'ar'TTr' 


UAX Li i ALiCntjA 


AATTAATGTA 


950 




afzr"a ar'ar'a a 


J\ TV 7\ T/^r^TV 71 /^/^ 


CAAGCTAGCT 


GCTCACGCGA 


1000 


ACTCGATCrA 


V-'^V^VJ 1 V^v^ i. J. J, 


71 r*!!;^ ar'TTTr' 
A^AtsAij i i i O 


7\ 7\ 7\^f^/^7\Tf^7\ 

iiiitiUijOAl CaA 


AATCTGCATG 


1050 




aaafzr'aTTriT 

Arin,uLj.A J. X u! i. 


Tr" T r* B rr^ T/^ r* 


t-AAG I AC C CC 


TCACACTCAC 


1100 




1. 1. i. t. ± 




1 AAl- X CAAGT 


TTATCCTTTC 


1150 


TTTGCTTCTG 


GAAATTTCAC 


AAGCCTCAAA 


CACGTCGACG 


TCCAATCTTG 


.1200 


TGACCAACAC 


GGCCAAAAGA 


AAAGAGAATC 


TCATCCCGTT 


CACACTTAGC 


1250 


CACTTAAAGC 


TAGCCAAACG 


GTGATCTTTC 


TCTATATATT 


GTAGCTCTCT 


1300 


AACACAACCA 


ACACTACCAT 


TATTCAATAT 


TCAAACCTTG 


CTCTATACTA 


1350 


CACACACTAG 


AAGAATA 
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(22) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: bp 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(ix) FEATURES 

(D) OTHER INFORMATION: sequence of promoter of gene 
coding for p-conglycinine 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

GTTTTCAAAT TTGAATTTTA ATGTGTGTTG TAAGTATAAA TTTAAAATAA 50 
AAATAAAAAC AATTATTATA TCAAAATGGC AAAAACATTT AATACGTATT 100 
ATTTATTAAA AAAATATGTA ATAATATATT TATATTTTAA TATCTATTCT 150 
TATGTATTTT TTAAAAATCT ATTATATATT GATCAACTAA AATATTTTTA 200 
TATCTACACT TATTTTGCAT TTTTATCAAT TTTCTTGCGT TTTTTGGCAT 250 
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ATTTAATAAT 


GACTATTCTT 


TAATAATCAA 


TCATTATTCT 


TACATGGTAC 


300 


ATATTGTTGG 


AACCATATGA 


AGTGTTCATT 


GCATTTGACT 


ATGTGGATAG 


350 


TGTTTTGATC 


CATGCCCTTC 


ATTTGCCGCT 


ATTAAXTAAT 


TTGGTAACAG 


400 


ATTCGTTCTA 


ATCAGTTACT 


TAATCCTTCC 


TCATCATAAT 


TAATCTGGTA 


450 


GTTCGAATGC 


CATAATATTG 


ATTAGTTTTT 


TGGACCATAA 


GAAAAAGCCA 


500 


AGG2^CAAAA 


GAAGACAAAA 


CACAATGAGA 


GTATCCTTTG 


CATAGCAATG 


550 


TCTAAGTTCA 


TAAAATTCAA 


ACAAAAACGC 


AATCACACAC 


AGTGGACATC 


600 


ACTTATCCAC 


TAGCTGATCA 


GGATCGCCGC 


GTCAAGAAAA 


AAAAACTGGA 


650 


CCCCAAAAGC 


CATGCACAAC 


AACACGTACT 


CACAAAGGCG 


TCAATCGAGC 


700 


AGCCCAAAAC 


ATTCACCAAC 


TCAACCCATC 


ATGAGCCCAC 


ACATTTGTTG 


750 


TTTCTAACCC 


AACCTCAAAC 


TCGTATTCTC 


TTCCGCCACC 


TCATTTTTGT 


800 


TTATTTCAAC 


ACCCGTCAAA 


CTGCATCCCA 


CCCCGTGGCG 


AAATGTTCAT 


850 


GCATGTTAAC 


AAGACCTATG 


ACTATAAATA 


TCTGCAATCT 


CGGCCCAAGT 


900 


TTTCATCATC 


AAGAACCAGT 


TCAATATCCT 


AGTACGCCGT 


ATTAAAGAAT 


950 


TTAAGATATA 


CT 
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